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NOVEL MEMBERS OF THE CAPSAICINS ANILLOID RECEPTOR FAMILY 

OF PROTEINS AND USES THEREOF 

Background of the Invention 

5 Pain is initiated when the peripheral terminals of a subgroup of sensory neurons 

are activated by noxious chemical, mechanical or thermal stimuli. These neurons, called 
nociceptors, transmit information regarding tissue damage to pain-processing centres in 
the spinal chord and brain (Fields, H.L. Pain, McGraw-Hill, New York, 1987). 
Nociceptors are characterized in part, by their sensitivity to capsaicin, a vanilloid- 

10 containing compound, and a natural product of capsicum peppers that is the active 
ingredient of many "hot" and spicy foods. In mammals, exposure of nociceptor 
terminals to capsaicin leads initially to excitation of the neuron and the consequent 
perception of pain and local release of inflammatory mediators. With prolonged 
exposure, nociceptor terminals become insensitive to capsaicin, as well as to other 

15 noxious stimuli (Szolcsanyi, J. in Capsaicin in the Study of Pain (ed. Wood, J.) 1-26 
(Academic, London, 1993). This latter phenomenon of nociceptor desensitization 
underlies the seemingly paradoxical use of capsaicin as an analgesic agent in the 
treatment of painful disorders ranging from viral and diabetic neuropathies to 
rheumatoid arthritis (Campbell, E. in Capsaicin and the Studyof Pain (ed. Wood, J.) 

20 255-272 (Academic. London, 1993); Szallasi, A. et al (1996) Pain 68, 195-208). Some 
of this decreased sensitivity to noxious stimuli may result from reversible changes in the 
nociceptor, but the long-term loss of responsiveness can be explained by death of the 
nociceptor or destruction of its peripheral terminals following exposure to capsaicin 
(Jancso, G. et al (1977) Nature 270, 741-743). 

25 The cellular specificity of capsaicin action and its ability to evoke the sensation 

of burning pain have led to speculation that the target of capsaicin action plays an 
important physiological role in the detection of painful stimuli. Indeed, capsaicin may 
elicit the perception of pain by mimicking the actions of a physiological stimulus or an 
endogenous ligand produced during tissue injury (James, I.F., Kinkina, N.N. & Wood, 

30 J.N. in Capsaicin in the Study of Pain (ed. Wood, J.N.) 83-104 (Academic, London, 
1993). 
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Caterina M.J. et al. have recently determined the molecular basis underlying this 
phenomenon by characterizing a functional cDNA that encodes a vanilloid receptor 
(VR-1) in rat sensory ganglia (Caterina M. J. et al, (1997) Nature 389:816-824). VR-1 
is a vanilloid-gated, nonselective cation channel that resembles members of the transient 
5 receptor potential (TRP) channel family, first identified as components of the 

Drosophila phototransduction pathway (Montell et ai (1989) Neuron 2:1313-1323). 
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Summary of the Invention 

The present invention is based, at least in part, on the discovery of novel 
members of the Capsaicin/Vanilloid family of receptors. Described herein is the 

10 isolation of the human orthologue of rat VR-1 (rVR-1), referred to herein as hVR-1, as 
well as another previously unknown member of the VR family of receptors, referred 
herein as VR-2, and specifically as human VR-2 (hVR-2, including an alternate form 
which contains a deletion) and rat VR-2 (rVR-2) nucleic acid and protein molecules. 
The hVR-1, hVR-2, and rVR-2 molecules of the present invention are useful as targets 

15 for developing modulating agents to regulate a variety of cellular processes, e.g., cellular 
processes involved in pain. Accordingly, in one aspect, this invention provides isolated 
nucleic acid molecules encoding hVR-1, hVR-2, and rVR-2 proteins and fragments 
thereof, as well as nucleic acid fragments suitable as primers or hybridization probes for 
the detection of h VR-1, hVR-2, and rVR-2-encoding nucleic acids. 

20 In one embodiment, an hVR-1, hVR-2, or rVR-2 nucleic acid molecule of the 

invention is at least 60%, 65%, 70%, 75%, 80%, 83%, 85%, 86%, 87%, 88%, 89%, 
90%>, 91%, 92%, 93%, 94%>, 95%, 96%, 97%, 98%, 99% or more identical to the 
nucleotide sequence (e.g., to the entire length of the nucleotide sequence) shown in SEQ 
ID NO:l, 3, 4, 6, 7, 9, 10, or 12 or a complement thereof. 

25 In another embodiment, the isolated nucleic acid molecule includes the 

nucleotide sequence shown SEQ ID NO:l, 3, 4, 6, 7, 9, 10, or 12, or a complement 
thereof. In another embodiment, the nucleic acid molecule includes at least 10, 15, 20, 
or more contiguous nucleotides of SEQ ID NO: 1, 3, 4, 6, 7, 9, 10, or 12. 
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In another embodiment, an hVR-1, hVR-2, and rVR-2 nucleic acid molecule 
includes a nucleotide sequence encoding a protein having an amino acid sequence 
sufficiently homologous to the amino acid sequence of SEQ ID NO:2, 5, 8, or 1 1. In 
one embodiment, an hVR-1, hVR-2, and rVR-2 nucleic acid molecule includes a 
5 nucleotide sequence encoding a protein having an amino acid sequence at least 60%, 
65%, 70%, 75%, 80%, 85%, 87%, 90%, 95%, 98% or more identical to the entire length 
of the amino acid sequence of SEQ ID NO:2, 5, 8, or 1 1 . 

Another embodiment of the invention features nucleic acid molecules, preferably 
hVR-1, hVR-2, and rVR-2 nucleic acid molecules, which specifically detect hVR-1, 
10 hVR-2, and rVR-2 nucleic acid molecules relative to nucleic acid molecules encoding 
non-hVR-1, non-hVR-2, and non-hVR-2 proteins. For example, in one embodiment, 
such a nucleic acid molecule is at least 100-150, 1 150-200, 200-250, 250-300, 300-350, 
350-400, 400-450, 450-500, 500-550, 550-600, 600-700, 700-800, 800-900, 900-1000, 
1088, or more nucleotides in length and hybridizes under stringent conditions to a 
15 nucleic acid molecule comprising the nucleotide sequence shown in SEQ ID NO:l, 3, 4, 
6, 7, 9, 10, or 12. In preferred embodiments, the nucleic acid molecules are at least 15 
(e.g., contiguous) nucleotides in length and hybridize under stringent conditions to 
nucleotides 1-17, 3696-3863, or 3901-3909 of SEQ ID NO:l. In other preferred 
embodiments, the nucleic acid molecules comprise nucleotides 1-17, 3696-3863, or 
20 3901-3909 of SEQ ID NO:l. In yet other preferred embodiments, the nucleic acid 

molecules consist of nucleotides 1-17, 3696-3863, or 3901-3909 of SEQ ID NO:l. In 
preferred embodiments, the nucleic acid molecules are at least 15 (e.g.. contiguous) 
nucleotides in length and hybridize under stringent conditions to nucleotides 1944-2003 
of SEQ ID NO:4. In other preferred embodiments, the nucleic acid molecules comprise 
25 nucleotides 1 944-2003 of SEQ ID NO:4. In yet other preferred embodiments, the 
nucleic acid molecules consist of nucleotides 1944-2003 of SEQ ID NO:4. 

In other embodiments, the nucleic acid molecule encodes a naturally occurring 
allelic variant of a polypeptide comprising the amino acid sequence of SEQ ID NO:2, 5, 
8, or 1 1 , wherein the nucleic acid molecule hybridizes to a nucleic acid molecule 
30 consisting of SEQ ID NO:l, 3, 4, 6, 7, 9, 10, or 12 under stringent conditions and is 
encoded by the same locus as hVR-1, hVR-2 or rVR-2. 
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Another embodiment of the invention provides a nucleic acid molecule that 
encodes a naturally occurring orthologue of a polypeptide comprising the amino acid 
sequence of SEQ ID NO:2, 5, 8, or 1 1, wherein the nucleic acid molecule hybridizes to a 
nucleic acid molecule consisting of SEQ ID NO:l, 3, 4, 6, 7, 9, 10, or 12 under stringent 
5 conditions. 

Another embodiment of the invention provides an isolated nucleic acid molecule 
which is antisense to an hVR-1, hVR-2, and rVR-2 nucleic acid molecule, e.g., the 
coding strand of an hVR-1, hVR-2, and rVR-2 nucleic acid molecule. 

Since the hVR2 (the alternate form) and rVR2 sequences represent fragments of 

1 0 the entire coding regions of these genes, another embodiment of the invention provides 
the complete gene sequences. A skilled artisan can readily isolate such molecule using 
the sequences disclosed herein. 

Another aspect of the invention provides a vector comprising an hVR-1, an hVR- 
2, or a rVR-2 nucleic acid molecule. In certain embodiments, the vector is a 

1 5 recombinant expression vector. In another embodiment, the invention provides a host 
cell containing a vector of the invention. In yet another embodiment, the invention 
provides a host cell containing a nucleic acid molecule of the invention. The invention 
also provides a method for producing a protein, preferably an hVR-1, hVR-2, and rVR-2 
protein, by culturing in a suitable medium, a host cell, e.g., a mammalian host cell such 

20 as a non-human mammalian cell, of the invention containing a recombinant expression 
vector, such that the protein is produced. 

Another aspect of this invention features isolated or recombinant hVR-1, hVR-2, 
and rVR-2 proteins and polypeptides. In one embodiment, the isolated protein, 
preferably an hVR-1, hVR-2 ? or rVR-2 protein, includes at least one transmembrane 

25 domain. In another embodiment, the isolated protein, preferably an hVR-1, hVR-2, or 
rVR-2 protein, includes at least one transmembrane domain and at least one proline rich 
domain. In yet another embodiment, the isolated protein, preferably an hVR-1 , hVR-2, 
or rVR-2 protein, includes at least one transmembrane domain, at least one proline rich 
domain, and at least one ankyrin repeat domain. In yet another embodiment, the protein, 

30 preferably an hVR-1 , hVR-2, or rVR-2 protein, includes at least one transmembrane 

domain, at least one proline rich domain, and at least one ankyrin repeat domain and has 
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an amino acid sequence at least about 60%, 65%, 70%, 75%, 80%, 85%, 87%, 90%, 
95%, 98% or more homologous to the amino acid sequence of SEQ ID NO:2, 5, 8, or 
1 1. In another embodiment, the protein, preferably an hVR-1, hVR-2, or rVR-2 protein, 
includes at least one transmembrane domain, at least one proline rich domain, and at 
5 least one ankyrin repeat domain and plays a role in the development and regulation of 
pain. In yet another embodiment, the protein, preferably an hVR-1, hVR-2, and rVR-2 
protein, includes at least one transmembrane domain, at least one proline rich domain, 
and at least one ankyrin repeat domain and is encoded by a nucleic acid molecule having 
a nucleotide sequence which hybridizes under stringent hybridization conditions to a 
10 nucleic acid molecule comprising the nucleotide sequence of SEQ ID NO:l, 3, 4, 6, 7, 9, 
10, or 12. 

In another embodiment, the invention features fragments of the protein having 
the amino acid sequence of SEQ ID NO:2, 5, 8, or 1 1, wherein the fragment comprises 
at least 15, 30, 40, 50, 60, 70, 80, 90, or 100 amino acids (e.g., contiguous amino acids). 

15 In another embodiment, the invention features an isolated protein, preferably an 

hVR-1, hVR-2, and rVR-2 protein, which is encoded by a nucleic acid molecule 
consisting of a nucleotide sequence at least about 60%, 65%, 70%, 75%, 80%, 83%, 
85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 
more homologous to a nucleotide sequence of SEQ ID NO:l, 3, 4, 6, 7, 9, 10, or 12, or a 

20 complement thereof. This invention further features an isolated protein, preferably an 
hVR-1, hVR-2, or rVR-2 protein, which is encoded by a nucleic acid molecule 
consisting of a nucleotide sequence which hybridizes under stringent hybridization 
conditions to a nucleic acid molecule consisting of the nucleotide sequence of SEQ ID 
NO:l, 3, 4, 6, 7, 9, 10, or 12, or a complement thereof. 

25 The proteins of the present invention or portions thereof, e.g., biologically active 

portions thereof, can be operatively linked to a non-hVR-1, non-hVR-2, or non-rVR-2 
polypeptide {e.g., heterologous amino acid sequences) to form fusion proteins. The 
invention further features antibodies, such as monoclonal or polyclonal antibodies, that 
specifically bind proteins of the invention, preferably hVR-1, hVR-2, and rVR-2 

30 proteins. In addition, the hVR-1, hVR-2, and rVR-2 proteins or biologically active 
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portions thereof can be incorporated into pharmaceutical compositions, which optionally 
include pharmaceutical^ acceptable carriers. 

In another aspect, the present invention provides a method for detecting the 
presence of an hVR-1, hVR-2, and rVR-2 nucleic acid molecule, protein or polypeptide 
5 in a biological sample by contacting the biological sample with an agent capable of 
detecting an hVR-1, hVR-2, and rVR-2 nucleic acid molecule, protein or polypeptide 
such that the presence of an hVR-1, hVR-2, and rVR-2 nucleic acid molecule, protein or 
polypeptide is detected in the biological sample. 

In another aspect, the present invention provides a method for detecting the 
Id presence of hVR-1, hVR-2, and rVR-2 activity in a biological sample by contacting the 
biological sample with an agent capable of detecting an indicator of hVR-1, hVR-2, and 
rVR-2 activity such that the presence of hVR-1, hVR-2, and rVR-2 activity is detected 
in the biological sample. 

In another aspect, the invention provides a method for modulating hVR-1, hVR- 

1 5 2. and rVR-2 activity comprising contacting a cell capable of expressing hVR-1, hVR-2, 
and rVR-2 with an agent that modulates hVR-1, hVR-2, and rVR-2 activity such that 
hVR-1, hVR-2, and rVR-2 activity in the cell is modulated. In one embodiment, the 
agent inhibits hVR-1, hVR-2, and rVR-2 activity. In another embodiment, the agent 
stimulates hVR-1, hVR-2, and rVR-2 activity. In one embodiment, the agent is an 

20 antibody that specifically binds to an hVR-1, hVR-2, and rVR-2 protein. In another 
embodiment, the agent modulates expression of hVR-1, hVR-2, and rVR-2 by 
modulating transcription of an hVR-1, hVR-2, and rVR-2 gene or translation of an hVR- 
1, hVR-2, and rVR-2 mRNA. In yet another embodiment, the agent is a nucleic acid 
molecule having a nucleotide sequence that is antisense to the coding strand of an hVR- 

25 1 , h VR-2, and rVR-2 mRNA or an hVR- 1 , hVR-2, and rVR-2 gene. 

In one embodiment, the methods of the present invention are used to treat a 
subject having a disorder characterized by aberrant hVR-1, hVR-2, and rVR-2 protein or 
nucleic acid expression or activity by administering an agent which is an hVR-1, hVR-2, 
and rVR-2 modulator to the subject. In one embodiment, the hVR-1, hVR-2, and rVR-2 

30 modulator is an h VR- 1 , hVR-2, and rVR-2 protein. In another embodiment the hVR- 1 , 
hVR-2, and rVR-2 modulator is an hVR-1, hVR-2, and rVR-2 nucleic acid molecule. In 
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yet another embodiment, the hVR-1, hVR-2, and rVR-2 modulator is a peptide, 
peptidomimetic, or other small molecule. In a further embodiment, the disorder 
characterized by aberrant hVR-l ? hVR-2 ? and rVR-2 protein or nucleic acid expression 
is a pain disorder, e.g., hyperalgesia. 
5 The present invention also provides a diagnostic assay for identifying the 

presence or absence of a genetic alteration characterized by at least one of (i) aberrant 
modification or mutation of a gene encoding an hVR-1, hVR-2, and rVR-2 protein; (ii) 
mis-regulation of the gene; and (iii) aberrant post-translational modification of an hVR- 
1, hVR-2, and rVR-2 protein, wherein a wild-type form of the gene encodes a protein 
1 0 with an hVR-1 , hVR-2, and rVR-2 activity (as described herein). 

In another aspect the invention provides a method for identifying a compound 
that binds to or modulates the activity of an hVR-1, hVR-2, and rVR-2 protein, by 
providing an indicator composition comprising an hVR-1, hVR-2, and rVR-2 protein 
having hVR-K hVR-2, and rVR-2 activity, contacting the indicator composition with a 
1 5 test compound, and determining the effect of the test compound on hVR- 1 , h VR-2, and 
rVR-2 activity in the indicator composition to identify a compound that modulates the 
activity of an hVR-1, hVR-2, and rVR-2 protein. 

Other features and advantages of the invention will be apparent from the 
following detailed description and claims. 

20 

Brief Description of the Drawings 

Figure 1 depicts the full length cDNA sequence and predicted amino acid 
sequence of human VR-1 (hVR-1). The nucleotide sequence corresponds to nucleic 
acids 1 to 3909 of SEQ ID NO:l. The amino acid sequence corresponds to amino acids 

25 1 to 839 of SEQ ID NO:2. The coding region without the 5' and 3 f untranslated regions 
of the human VR-1 (h VR-1) gene is shown in SEQ ID NO:3. 

Figure 2 depicts the full length cDNA sequence and predicted amino acid 
sequence of human VR-2 (hVR-2). The nucleotide sequence corresponds to nucleic 
acids 1 to 2809 of SEQ ID NO:4. The amino acid sequence corresponds to amino acids 

30 1 to 764 of SEQ ID NO:5. The coding region without the 5' and 3* untranslated regions 
of the human VR-2 (hVR-2) gene is shown in SEQ ID NO:6. 
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Figure 3 depicts the partial cDNA sequence and partial predicted amino acid 
sequence of an alternate form of human VR-2 (hVR-2). The nucleotide sequence 
corresponds to nucleic acids 1 to 1489 of SEQ ID NO:7. The amino acid sequence 
corresponds to amino acids 1 to 436 of SEQ ID NO: 8. The coding region without the 5' 
5 and 3* untranslated regions of the alternate form of human VR-2 (hVR-2) gene is shown 
in SEQ ID NO:9. 

Figure 4 depicts the partial cDNA sequence and partial predicted amino acid 
sequence of rat VR-2 (rVR-2). The nucleotide sequence corresponds to nucleic acids 1 
to 1794 of SEQ ID NO: 10. The amino acid sequence corresponds to amino acids 1 to 

10 554 of SEQ ID NO: 1 1 . The coding region without the 5* and 3' untranslated regions of 
the rat VR-2 (rVR-2) gene is shown in SEQ ID NO: 12. 

Figure 5 depicts an alignment of the hVR-1 protein (SEQ ID NO:2) with the 
human VR-2 protein (SEQ ID NO:5) using the GAP program in the GCG software 
package (Blosum 62 matrix) and a gap weight of 12 and a length weight of 4. 

15 Figure 6 depicts an alignment of the hVR-1 nucleotide sequence (SEQ ID NO: 1) 

with the human VR-2 nucleotide sequence (SEQ ID NO:4) using the GAP program in 
the GCG software package (nwsgapdna matrix) and a gap weight of 50 and a length 
weight of 3. 

Figure 7 depicts an alignment of the hVR-2 protein (SEQ ID NO: 5) with the rat 
20 VR-2 protein (SEQ ID NO:l 1) using the CLUSTAL W (1.74) multiple sequence 
alignment program. 

Figure 8 depicts an alignment of the hVR-2 protein (SEQ ID NO:5) with the rat 
VR-2 protein (SEQ ID NO:l 1) using the GAP program in the GCG software package 
(Blosum 62 matrix) and a gap weight of 12 and a length weight of 4. 
25 Figure 9 depicts an alignment of the hVR- 1 nucleotide sequence (SEQ ID NO: 1 ) 

with the rat VR-1 nucleotide sequence (Accession Number: AF0293 1 0) using the GAP 
program in the GCG software package (nwsgapdna matrix) and a gap weight of 50 and a 
length weight of 3. 

Figure 10 depicts an alignment of the hVR-1 protein (SEQ ID NO:2) with the rat 
30 VR-1 protein (Accession Number: AF0293 10) using the GAP program in the GCG 

software package (Blosum 62 matrix) and a gap weight of 12 and a length weight of 4, 
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Figure 11 depicts an alignment of the hVR-2 protein (SEQ ID NO:5) with the 
human VR-2 protein (alternate form) (SEQ ID NO:8) using the CLUSTAL W (1.74) 
multiple sequence alignment program. 

Figure 12 depicts a structural, hydrophobicity, and antigenicity analysis of the 
5 hVR-1 protein. 

Figure 13 depicts the results of a search using the amino acid sequence of the 
hVR-1 protein against the HMM database. 

Figure 14 depicts a structural, hydrophobicity, and antigenicity analysis of the 
hVR-2 protein. 

10 Figure 15 depicts the results of a search using the amino acid sequence of the 

hVR-2 protein against the HMM database. 

Figure 16 depicts the predicted full length amino acid sequence of the human 
VR-2 protein (alternate form) (SEQ ID NO:20). 

Figure 7 7 depicts an alignment of the hVR-2 protein (SEQ ID NO:5) with the 
1 5 predicted full length human VR-2 protein (alternate form) (SEQ ID NO:20) using the 
CLUSTAL W (1 .74) multiple sequence alignment program. 

Detailed Description of the Invention 

The present invention is based, at least in part, on the discovery of nucleic acid 
20 and amino acid molecules which are novel members of the Capsaicin/Vanilloid family 
of receptors. Described herein is the isolation of the human orthologue of rat VR-1 
(rVR-1), referred to herein as hVR-1, as well as another previously unknown member of 
the VR family of receptors, referred herein as VR-2, and specifically as human VR-2 
(hVR-2) and rat VR-2 (rVR-2) nucleic acid and protein molecules. The hVR-L hVR-2, 
25 and rVR-2 molecules were identified based on their sequence similarity to the known rat 
vanilloid receptor (VR-1). VR-1 is a vanilloid gated, non-selective cation channel 
which resembles members of the transient receptor potential (TRP) ion channel family 
(described in Montell et al (1989) Neuron 2:1313-1323) that mediate the influx of 
extracellular calcium in response to depletion of intracellular calcium stores. The rat 
30 VR-1 cDNA contains an open reading frame of 25 1 4 nucleotides that encodes a protein 
of 838 amino acids. Hydrophilicity analysis has indicated that rat VR-1 contains six 
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transmembrane domains (predicted to be mostly ct-helices) with an additional short 
hydrophobic stretch between transmembrane regions 5 and 6. The amino terminal 
hydrophilic segment contains a relatively proline rich region followed by three ankyrin 
repeat domains. The rat VR-1 is expressed in small diameter neurons within sensory 
5 ganglia. The present hVR-1 sequence is the human orthologue of rVR-1. As described 
in further detail infra, the human VR-1 is expressed in nodose, trigeminal sensory 
neurons, as well as in some, but not all, small dorsal root ganglion (DRG) neurons and 
in a few medium sized DRG neurons. 

The hVR-1, hVR-2, and rVR-2 molecules of the present invention play a role in 

10 pain signaling mechanisms. As used herein, the term "pain signaling mechanisms" 
includes the cellular mechanisms involved in the development and regulation of pain, 
e.g., pain elicited by noxious chemical, mechanical, or thermal stimuli, in a subject, e.g., 
a mammal such as a human. In mammals, the initial detection of noxious chemical, 
mechanical, or thermal stimuli, a process referred to as "nociception", occurs 

15 predominantly at the peripheral terminals of specialized, small diameter primary afferent 
neurons, called polymodal nociceptors. These afferent neurons transmit the information 
to the central nervous system, evoking a perception of pain or discomfort and initiating 
appropriate protective reflexes. Capsaicin/V anilloid receptors, e.g., the hVR-1, hVR-2, 
and rVR-2 molecules of the present invention, present on these afferent neurons, are 

20 involved in detecting these noxious chemical, mechanical, or thermal stimuli and 

transducing this information into membrane depolarization events. Thus, the hVR-1, 
hVR-2, and rVR-2 molecules by participating in pain signaling mechanisms, can 
modulate pain elicitation and provide novel diagnostic targets and therapeutic agents to 
control pain. 

25 The hVR-1, hVR-2, and rVR-2 molecules provide novel diagnostic targets and 

therapeutic agents to control pain in a variety of disorders, diseases, or conditions which 
are characterized by a deregulated, e.g., upregulated or downregulated, pain response. 
For example, the hVR-1, hVR-2, and rVR-2 molecules provide novel diagnostic targets 
and therapeutic agents to control the exaggerated pain response elicited during various 

30 forms of tissue injury, e.g., inflammation, infection, and ischemia, usually referred to as 
hyperalgesia (described in, for example, Fields, H.L. (1987) Pain, New York:McGraw- 
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Hill). Moreover, the hVR-1, hVR-2, and rVR-2 molecules provide novel diagnostic 
targets and therapeutic agents to control pain associated with muscoloskeletal disorders, 
e.g., joint pain; tooth pain; headaches; pain associated with surgery, or neuropathic pain. 
As the hVR-1 gene maps to a region of human chromosome 17 between WI- 
5 5436 (7.7cR) and WI-6584 (18.9cR) (Example 6), which has been associated with 

myasthenia gravis, Smith-Magenis syndrome, CORDS, Cone-rod dysrtophy, and breast 
cancer, the hVR-1 molecule may provide novel diagnostic targets and therapeutic agents 
to treat, diagnose, or prognose these disorders or other disorders linked to this 
chromosomal region. Similarly, as the hVR-2 gene maps to a region of human 

10 chromosome 17 between AFMA043ZB5 (23.3 cR) and D17S721 (29.3cR) (Example 6) 
which has been associated with myasthenia gravis. Smith-Magenis syndrome. CORD5, 
Cone-rod dysrtophy, choroidal dystrophy, central areolar, and retinal cone dystrophy, 
the hVR-2 molecule may provide novel diagnostic targets and therapeutic agents to treat, 
diagnose, or prognose these disorders or other disorders linked to this chromosomal 

1 5 region. 

The term "family" when referring to the protein and nucleic acid molecules of 
the invention is intended to mean two or more proteins or nucleic acid molecules having 
a common structural domain or motif and having sufficient amino acid or nucleotide 
sequence homology as defined herein. Such family members can be naturally or non- 
20 naturally occurring and can be from either the same or different species. For example, a 
family can contain a first protein of human origin, as well as other, distinct proteins of 
human origin or alternatively, can contain homologues of non-human origin. Members 
of a family may also have common functional characteristics. 

For example, the family of hVR-1, hVR-2, and rVR-2 proteins comprise at least 
25 one, and preferably six "transmembrane domains." As used herein, the term 

"transmembrane domain" includes an amino acid sequence of about 1 5 amino acid 
residues in length which spans the plasma membrane. More preferably, a 
transmembrane domain includes about at least 20, 25, 30, 35, 40, or 45 amino acid 
residues and spans the plasma membrane. Transmembrane domains are rich in 
30 hydrophobic residues, and typically have a helical structure. In a embodiment, at least 
50%, 60%, 70%, 80%, 90%, 95% or more of the amino acid residues of a 
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transmembrane domain are hydrophobic, e.g., leucines, isoleucines, tyrosines, or 
tryptophans. Transmembrane domains are described in, for example, Zagotta W.N. et 
al, (1996) Annual Rev. Neurosci. 19: 235-63, the contents of which are incorporated 
herein by reference. Amino acid residues 434-455, 480-495, (509-53 1 ; based on 
5 homology to the rat VR- 1 ) or 5 1 4-53 1 , (543-569; based on homology to the rat VR- 1 ) or 
538-555, (577-596; based on homology to the rat VR-1) or 580-599, and (656-683; 
based on homology to the rat VR-1) or 658-682 of hVR-1 (SEQ ID NO:2) and amino 
acid residues 391-410, 431-448, 459-476, 486-508, 538-556, and 621-645 of hVR-2 
(SEQ ID NO:5) comprise transmembrane domains. 

10 In another embodiment, an hVR-1, hVR-2, and rVR-2 of the present invention is 

identified based on the presence of a "proline rich domain'" in the protein or 
corresponding nucleic acid molecule. As used herein, the term "proline rich domain" 
includes an amino acid sequence of about 4-6 amino acid residues in length having the 
general sequence X-Pro-X-X-Pro-X (where X can be any amino acid). Proline rich 

15 domains are usually located in a helical structure and bind through hydrophobic 
interactions to SH3 domains. SH3 domains recognize proline rich domains in both 
forward and reverse orientations. Proline rich domains are described in, for example, 
Sattler M. et al (1998) Leukemia 12:637-644, the contents of which are incorporated 
herein by reference. 

20 In another embodiment, an hVR-1, hVR-2. and rVR-2 of the present invention is 

identified based on the presence of an "ankyrin repeat domain''' in the protein or 
corresponding nucleic acid molecule. As used herein, the term "ankyrin repeat domain" 
includes a protein domain having an amino acid sequence of about 30-50 amino acid 
residues and having a bit score for the alignment of the sequence to the ankyrin repeat 

25 domain (HMM) of at least 6. Preferably, an ankyrin repeat domain includes at least 
about 30-45, more preferably about 30-40 amino acid residues, or about 30-35 amino 
acids and has a bit score for the alignment of the sequence to the ankyrin repeat domain 
(HMM) of at least 3-10, more preferably 10-30, more preferably 30-50, even more 
preferably 50-75, 75-100, 100-200 or greater. The ankyrin repeat domain HMM has 

30 been assigned the PFAM Accession PF00023 (http://genome.wustLedu/Pfam/.html). 
Ankyrin repeats are involved in protein-protein interactions and are described in, for 
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example, Ketchum K.A et ai (1996) FEBS Letters 378:19-26, the contents of which are 
incorporated herein by reference. 

To identify the presence of an ankyrin repeat domain in an hVR-L hVR-2, and 
rVR-2 protein and make the determination that a protein of interest has a particular 
5 profile, the amino acid sequence of the protein is searched against a database ofHMMs 
(e.g., the Pfam database, release 2.1) using the default parameters 
(http://vvww.sanger.ac.uk/Software/Pfam/HMM_search). A description of the Pfam 
database can be found in Sonhammer et ai (1997) Proteins 28(3)405-420 and a detailed 
description ofHMMs can be found, for example, in Gribskov et ai (1990) Meth. 
10 Enzymol 183:146-159; Gribskov et ai( 1987) Proc. Natl Acad. Sci. USA 84:4355- 
4358; BCrogh et ai ( 1 994) J. Moi Bioi 235:1501-1531; and Stultz et a/.(1993) Protein 
Sci. 2:305-3 14, the contents of which are incorporated herein by reference. A search 
was performed against the HMM database resulting in the identification of three ankyrin 
repeat domains in the amino acid sequence of SEQ ID NO:2 (at about residues 201-233, 
15 248-283, and 333-361) and SEQ ID NO:5 (at about residues 162-194, 208-243, and 293- 
328). The results of the searches are set forth in Figures 13 and 15. 

Isolated proteins of the present invention, preferably hVR-1, hVR-2, and rVR-2 
proteins, have an amino acid sequence sufficiently identical to the amino acid sequence of 
SEQ ID NO:2, 5, 8, or 1 1 or are encoded by a nucleotide sequence sufficiently identical to 

20 SEQ ID NO:l, 3, 4, 6, 7, 9, 10, or 12. As used herein, the term "sufficiently identical" 

refers to a first amino acid or nucleotide sequence which contains a sufficient or minimum 
number of identical or equivalent (e.g., an amino acid residue which has a similar side 
chain) amino acid residues or nucleotides to a second amino acid or nucleotide sequence 
such that the first and second amino acid or nucleotide sequences share common structural 

25 domains or motifs and/or a common functional activity. For example, amino acid or 

nucleotide sequences which share common structural domains have at least 30%, 40%, or 
50% identity, preferably 60% identity, more preferably 70%-80%, and even more 
preferably 90-95% identity across the amino acid sequences of the domains and contain at 
least one and preferably two structural domains or motifs, are defined herein as sufficiently 

30 identical. Furthermore, amino acid or nucleotide sequences which share at least 30%, 40%, 
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or 50%, preferably 60%, more preferably 70-80%, or 90-95% identity and share a common 
functional activity are defined herein as sufficiently identical. 

As used interchangeably herein, an "hVR-1, hVR-2, and rVR-2 activity", "biological 
activity of hVR-1 , hVR-2, and rVR-2" or "functional activity of hVR-l ? hVR-2, and rVR-2", 
5 refers to an activity exerted by an hVR-1, hVR-2, and rVR-2 protein, polypeptide or nucleic 
acid molecule on an hVR-1, hVR-2, and rVR-2 responsive cell or on an hVR-1, hVR-2, and 
rVR-2 protein substrate, as determined in vivo, or in vitro, according to standard techniques. 
In one embodiment, an hVR-1, hVR-2, and rVR-2 activity is a direct activity, such as an 
association with an hVR-1, hVR-2, and rVR-2-target molecule. As used herein, a "target 

1 0 molecule" or "binding partner" is a molecule with which an hVR-1, hVR-2, and rVR-2 

protein binds or interacts in nature, such that hVR-1, hVR-2, and rVR-2-mediated function is 
achieved. An hVR-K hVR-2, and rVR-2 target molecule can be a non-hVR-L non-hVR-2, 
and non-rVR-2 molecule or an hVR-1, hVR-2, and rVR-2 protein or polypeptide of the 
present invention. In an exemplary embodiment, an hVR-1, hVR-2, and rVR-2 target 

15 molecule is an hVR-1, hVR-2, and rVR-2 ligand, e.g., capsaicin. Alternatively, an hVR-1, 
hVR-2, and rVR-2 activity is an indirect activity, such as a cellular signaling activity 
mediated by interaction of the hVR-1, hVR-2, and rVR-2 protein with an hVR-1, hVR-2, and 
rVR-2 ligand. 

Accordingly, another embodiment of the invention features isolated hVR-1, 
20 hVR-2, and rVR-2 proteins and polypeptides having an hVR-1 , hVR-2, and rVR-2 

activity. Other proteins of the invention are hVR-1, hVR-2, and rVR-2 proteins having 
at least one, and preferably six, transmembrane domains and, preferably, an hVR-1, 
hVR-2, and rVR-2 activity. Yet other proteins of the invention are hVR-1, hVR-2, and 
rVR-2 proteins having at least one transmembrane domain, at least one proline rich 
25 domain and, preferably, an hVR-1 , hVR-2, and rVR-2 activity. Other proteins of the 
invention are hVR-1, hVR-2, and rVR-2 proteins having at least one transmembrane 
domain, at least one proline rich domain, at least one ankyrin repeat domain and, 
preferably, an hVR-1, hVR-2, and rVR-2 activity. Additional proteins of the invention 
have at least one transmembrane domain, at least one proline rich domain, at least one 
30 ankyrin repeat domain, and are, preferably, encoded by a nucleic acid molecule having a 
nucleotide sequence which hybridizes under stringent hybridization conditions to a 
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nucleic acid molecule comprising the nucleotide sequence of SEQ ID NO:l, 3, 4, 6, 7, 9, 
10, or 12. 

The nucleotide sequence of the full length hVR-1 cDNA and the predicted 
amino acid sequence of the hVR-1 polypeptide are shown in Figure 1 and in SEQ ID 
5 NOs:l and 2, respectively. 

The nucleotide sequence of the full length hVR-2 cDNA and the predicted amino 
acid sequence of the hVR-2 polypeptide are shown in Figure 2 and in SEQ ID NOs:4 
and 5, respectively. 

The nucleotide sequence of the partial hVR-2 (alternate form) cDNA and the 
10 predicted amino acid sequence of the hVR-2 (alternate form) polypeptide are shown in 
Figure 3 and in SEQ ID NOs:7 and 8, respectively. 

The nucleotide sequence of the partial rVR-2 cDNA and the predicted amino 
acid sequence of the rVR-2 polypeptide are shown in Figure 4 and in SEQ ID NOs:10 
and 1 1, respectively. 

15 Various aspects of the invention are described in further detail in the following 

subsections: 

I. Isolated Nucleic Acid Molecules 

One aspect of the invention pertains to isolated nucleic acid molecules that 

20 encode hVR-1 , hVR-2, and rVR-2 proteins or biologically active portions thereof, as 

well as nucleic acid fragments sufficient for use as hybridization probes to identify hVR- 
1, hVR-2, and rVR-2-encoding nucleic acid molecules (e.g., hVR-1, hVR-2, and rVR-2 
mRNA) and fragments for use as PCR primers for the amplification or mutation of hVR- 
1, hVR-2, and rVR-2 nucleic acid molecules. As used herein, the term "nucleic acid 

25 molecule" is intended to include DNA molecules (e.g., cDNA or genomic DNA) and 
RNA molecules (e.g., mRNA) and analogs of the DNA or RNA generated using 
nucleotide analogs. The nucleic acid molecule can be single-stranded or double- 
stranded, but preferably is double-stranded DNA. 

The term "isolated nucleic acid molecule" includes nucleic acid molecules which 

30 are separated from other nucleic acid molecules which are present in the natural source 
of the nucleic acid. For example, with regards to genomic DNA, the term "isolated" 
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includes nucleic acid molecules which are separated from the chromosome with which 
the genomic DNA is naturally associated. Preferably, an "isolated" nucleic acid is free 
of sequences which naturally flank the nucleic acid (i.e., sequences located at the 5' and 
3' ends of the nucleic acid) in the genomic DNA of the organism from which the nucleic 
5 acid is derived. For example, in various embodiments, the isolated hVR-1 , hVR-2, and 
rVR-2 nucleic acid molecule can contain less than about 5 kb, 4kb, 3kb, 2kb, 1 kb, 0.5 
kb or 0.1 kb of nucleotide sequences which naturally flank the nucleic acid molecule in 
genomic DNA of the cell from which the nucleic acid is derived. Moreover, an 
"isolated 1 * nucleic acid molecule, such as a cDNA molecule, can be substantially free of 
10 other cellular material, or culture medium when produced by recombinant techniques, or 
substantially free of chemical precursors or other chemicals when chemically 
synthesized. 

A nucleic acid molecule of the present invention, e.g.^a nucleic acid molecule 
having the nucleotide sequence of SEQ ID NO:l, 3, 4, 6, 7, 9, 10, or 12. Using all or 

15 portion of the nucleic acid sequence of SEQ ID NO:l, 3, 4, 6, 7, 9, 10, or 12, as a 

hybridization probe, hVR-1, hVR-2, and rVR-2 nucleic acid molecules can be isolated 
using standard hybridization and cloning techniques (e.g., as described in Sambrook, J., 
Fritsh, E. F., and Maniatis, T. Molecular Cloning; A Laboratory Manual. 2nd, ed., Cold 
Spring Harbor Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, 

20 NY, 1989). 

Moreover, a nucleic acid molecule encompassing all or a portion of SEQ ID 
NO:l, 3, 4, 6, 7, 9, 10, or 12, can be isolated by the polymerase chain reaction (PCR) 
using synthetic oligonucleotide primers designed based upon the sequence of SEQ ID 
NO:l,3, 4, 6, 7, 9, 10, or 12. 

25 A nucleic acid of the invention can be amplified using cDNA, mRNA or 

alternatively, genomic DNA, as a template and appropriate oligonucleotide primers 
according to standard PCR amplification techniques. The nucleic acid so amplified can 
be cloned into an appropriate vector and characterized by DNA sequence analysis. 
Furthermore, oligonucleotides corresponding to hVR-1, hVR-2, and rVR-2 nucleotide 

30 sequences can be prepared by standard synthetic techniques, e.g., using an automated 
DNA synthesizer. 
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In one embodiment, an isolated nucleic acid molecule of the invention comprises 
the nucleotide sequence shown in SEQ ID NO: 1 . The sequence of SEQ ID NO: 1 
corresponds to the full length hVR-1 encoding cDNA. 

In another embodiment, an isolated nucleic acid molecule of the invention 
5 comprises the nucleotide sequence shown in SEQ ID NO:4. The sequence of SEQ ID 
NO:4 corresponds to the full length hVR-2 encoding cDNA. 

In another embodiment, an isolated nucleic acid molecule of the invention 
comprises the nucleotide sequence shown in SEQ ID NO:7. The sequence of SEQ ID 
NO:7 corresponds to a fragment of the hVR-2 (alternate form) encoding cDNA. 
10 In another embodiment, an isolated nucleic acid molecule of the invention 

comprises the nucleotide sequence shown in SEQ ID NO: 10. The sequence of SEQ ID 
NO: 10 corresponds to a fragment of the rVR-2 cDNA. 

In another embodiment, an isolated nucleic acid molecule of the invention 
comprises a nucleic acid molecule which is a complement of the nucleotide sequence 
15 shown in SEQ ID NO:l, 3, 4, 6, 7, 9, 10, or 12, or a portion of any of these nucleotide 
sequences. A nucleic acid molecule which is complementary to the nucleotide sequence 
shown in SEQ ID NO:l, 3, 4, 6, 7, 9, 10, or 12, is one which is sufficiently 
complementary to the nucleotide sequence shown in SEQ ID NO:l, 3, 4, 6, 7, 9, 10, or 
12, such that it can hybridize to the nucleotide sequence shown in SEQ ID NO:l, 3, 4, 6, 
20 7, 9, 10, or 12 thereby forming a stable duplex. 

In still another embodiment, an isolated nucleic acid molecule of the present 
invention comprises a nucleotide sequence which is at least about 60%, 65%, 70%, 75%, 
80%, 83%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 
98%, 99% or more homologous to the entire length of the nucleotide sequence shown in 
25 SEQ ID NO: 1, 3, 4, 6, 7, 9, 10, or 12, or a portion of any of these nucleotide sequences. 

Moreover, the nucleic acid molecule of the invention can comprise only a portion 
of the nucleic acid sequence of SEQ ID NO:l, 3, 4, 6, 7, 9, 10, or 12, for example, a 
fragment which can be used as a probe or primer or a fragment encoding a portion of an 
hVR-1, hVR-2, and rVR-2 protein, e.g., a biologically active portion of an hVR-1, hVR- 
30 2, and rVR-2 protein. The nucleotide sequence determined from the cloning of the 
hVR-1, hVR-2, and rVR-2 gene allows for the generation of probes and primers 
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designed for use in identifying and/or cloning other hVR-1, hVR-2. and rVR-2 family 
members, as well as hVR-1, hVR-2, and rVR-2 homologues from other species. The 
probe/primer typically comprises a substantially purified oligonucleotide. The 
oligonucleotide typically comprises a region of nucleotide sequence that hybridizes 
5 under stringent conditions to at least about 12 or 15, preferably about 20 or 25, more 
preferably about 30, 35, 40, 45, 50, 55, 60, 65, 75, or 100 consecutive nucleotides of a 
sense sequence of SEQ ID NO: 1 , 3, 4, 6, 7, 9, 10, or 12, of an anti-sense sequence of 
SEQ ID NO: 1, 3, 4, 6, 7, 9, 10, or 12, or of a naturally occurring allelic variant or mutant 
of SEQ ID NO:l, 3, 4, 6, 7, 9, 10, or 12. In an exemplary embodiment, a nucleic acid 

10 molecule of the present invention comprises a nucleotide sequence which is greater than 
100-150, 150-200. 200-250, 250-300, 300-350, 350-400, 400-450, 450-500, 500-550, 
550-600, 600-650, 650-700, 700-750, 750-800, 800-850, 850-900, 900-950, 950-1000, 
1088, or more nucleotides in length and hybridizes under stringent hybridization 
conditions to a nucleic acid molecule of SEQ ID NO:l, 3, 4, 6, 7, 9, 10. or 12. 

15 Probes based on the hVR-1, hVR-2, and rVR-2 nucleotide sequences can be used 

to detect transcripts or genomic sequences encoding the same or homologous proteins. 
In preferred embodiments, the probe further comprises a label group attached thereto, 
e.g., the label group can be a radioisotope, a fluorescent compound, an enzyme, or an 
enzyme co-factor. Such probes can be used as a part of a diagnostic test kit for 

20 identifying cells or tissue which misexpress an hVR-1, hVR-2, and rVR-2 protein, such 
as by measuring a level of an hVR-1, hVR-2, and rVR-2-encoding nucleic acid in a 
sample of cells from a subject e.g., detecting hVR-1, hVR-2, and rVR-2 mRNA levels or 
determining whether a genomic hVR-1, hVR-2, and rVR-2 gene has been mutated or 
deleted. 

25 A nucleic acid fragment encoding a "biologically active portion of an hVR-1, 

hVR-2, and rVR-2 protein" can be prepared by isolating a portion of the nucleotide 
sequence of SEQ ID NO:l, 3, 4, 6, 7, 9, 10, or 12, which encodes a polypeptide having 
an hVR-1, hVR-2, and rVR-2 biological activity (the biological activities of the hVR-1, 
hVR-2, and rVR-2 proteins are described herein), expressing the encoded portion of the 

30 hVR-1, hVR-2, and rVR-2 protein (e.g., by recombinant expression in vitro) and 

assessing the activity of the encoded portion of the hVR-1, hVR-2, and rVR-2 protein. 
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The invention further encompasses nucleic acid molecules that differ from the 
nucleotide sequence shown in SEQ ID NO:K 3, 4, 6, 7, 9, 10, or 12, due to degeneracy 
of the genetic code and thus encode the same hVR-1, hVR-2, and rVR-2 proteins as 
those encoded by the nucleotide sequence shown in SEQ ID NO:l, 3, 4, 6, 7, 9, 10, or 
5 12. In another embodiment, an isolated nucleic acid molecule of the invention has a 
nucleotide sequence encoding a protein having an amino acid sequence shown in SEQ 
ID NO:2, 5,8, or 11. 

In addition to the hVR-1, hVR-2, and rVR-2 nucleotide sequences shown in SEQ 
ID NO:l, 3, 4, 6, 7, 9, 10, or 12, it will be appreciated by those skilled in the art that 

10 DNA sequence polymorphisms that lead to changes in the amino acid sequences of the 
hVR-1, hVR-2, and rVR-2 proteins may exist within a population (e.g., the human 
population). Such genetic polymorphism in the hVR-1, hVR-2, and rVR-2 genes may 
exist among individuals within a population due to natural allelic variation. As used 
herein, the terms "gene" and "recombinant gene" refer to nucleic acid molecules which 

15 include an open reading frame encoding an hVR-1, hVR-2, and rVR-2 protein, 

preferably a mammalian hVR-1, hVR-2, and rVR-2 protein, and can further include non- 
coding regulatory sequences, and introns. 

Allelic variants of hVR-1, hVR-2, and rVR-2 include both functional and non- 
functional hVR-1, hVR-2, and rVR-2 proteins. Functional allelic variants are naturally 

20 occurring amino acid sequence variants of the hVR-L hVR-2, and rVR-2 protein that 
maintain the ability to bind an hVR-1, hVR-2, and rVR-2 ligand and/or modulate a pain 
signaling mechanism. Functional allelic variants will typically contain only 
conservative substitution of one or more amino acids of SEQ ID NO:2, 5, 8, or 1 1, or 
substitution, deletion or insertion of non-critical residues in non-critical regions of the 

25 protein. 

Non-functional allelic variants are naturally occurring amino acid sequence 
variants of the hVR-1, hVR-2, and rVR-2 protein that do not have the ability to either 
bind an hVR-1, hVR-2, and rVR-2 ligand and/or modulate a pain signaling mechanism. 
Non-functional allelic variants will typically contain a non-conservative substitution, a 
30 deletion, or insertion or premature truncation of the amino acid sequence of SEQ ID 
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NO:2, 5, 8, or 1 L or a substitution, insertion or deletion in critical residues or critical 
regions. 

The present invention further provides non-human orthologues of the hVR-2 and 
rVR-2 protein. Orthologues of the hVR-2 and rVR-2 protein are proteins that are 
5 isolated from non-human and non-rat organisms and possess the same hVR-2 and rVR- 
2 ligand binding and/or modulation of pain signaling mechanism capabilities of the 
hVR-2 and rVR-2 proteins. Orthologues of the hVR-2 and rVR-2 proteins can readily 
be identified as comprising an amino acid sequence that is substantially homologous to 
SEQ ID NO: 4, 6, 8 or 10. 
10 Moreover, nucleic acid molecules encoding other hVR-1, hVR-2, and rVR-2 

family members and, thus, which have a nucleotide sequence which differs from the 
hVR-1, hVR-2, and rVR-2 sequences of SEQ ID NO:l, 3, 4, 6, 7, 9, 10, or 12, are 
intended to be within the scope of the invention. For example, another hVR-1, hVR-2, 
and rVR-2 cDNA can be identified based on the nucleotide sequence of hVR-1, hVR-2, 
15 and rVR-2. Moreover, nucleic acid molecules encoding VR-2 proteins from different 
species, and which, thus, have a nucleotide sequence which differs from the hVR-2 and 
rVR-2 sequences of SEQ ID NO:4, 6, 8, or 10 are intended to be within the scope of the 
invention. For example, a mouse hVR-2 cDNA can be identified based on the 
nucleotide sequence of the human VR-2 (hVR-2) or the rat VR-2 (rVR-2). 
20 Nucleic acid molecules corresponding to natural allelic variants and homologues 

of the hVR-1, hVR-2, and rVR-2 cDNAs of the invention can be isolated based on their 
homology to the hVR-1, hVR-2, and rVR-2 nucleic acids disclosed herein using the 
cDNAs disclosed herein, or a portion thereof, as a hybridization probe according to 
standard hybridization techniques under stringent hybridization conditions. Nucleic acid 
25 molecules corresponding to natural allelic variants and homologues of the hVR-1, hVR- 
2, and rVR-2 cDNAs of the invention can further be isolated by mapping to the same 
chromosome or locus as the hVR-1, hVR-2, and rVR-2 gene. 

Accordingly, in another embodiment, an isolated nucleic acid molecule of the 
invention is at least 15, 20, 25, 30 or more nucleotides in length and hybridizes under 
30 stringent conditions to the nucleic acid molecule comprising the nucleotide sequence of 
SEQ ID NO:l, 3, 4, 6, 7, 9, 10, or 12. In other embodiment, the nucleic acid is at least 
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30, 50. 100, 150, 200, 250, 300, 350, 400, 450, 500, 550, 600. 650, 700, 750, 800, 850, 
900, or 950 nucleotides in length. As used herein, the term "hybridizes under stringent 
conditions" is intended to describe conditions for hybridization and washing under 
which nucleotide sequences at least 60% identical to each other typically remain 
5 hybridized to each other. Preferably, the conditions are such that sequences at least 

about 70%, more preferably at least about 80%, even more preferably at least about 85% 
or 90% identical to each other typically remain hybridized to each other. Such stringent 
conditions are known to those skilled in the art and can be found in Current Protocols in 
Molecular Biology, John Wiley & Sons, N.Y. (1989), 6.3.1-6.3.6. A preferred, non- 
1 0 limiting example of stringent hybridization conditions are hybridization in 6X sodium 
chloride/sodium citrate (SSC) at about 45°C, followed by one or more washes in 0.2 X 
SSC. 0. !% SDS at 50°C, preferably at 55°C, more preferably at 60°C, and even more 
preferably at 65°C. Preferably, an isolated nucleic acid molecule of the invention that 
hybridizes under stringent conditions to the sequence of SEQ ID NO:l, 3, 4, 6, 7, 9, 10, 
15 or 1 2 corresponds to a naturally-occurring nucleic acid molecule. As used herein, a 

"naturally-occurring" nucleic acid molecule refers to an RNA or DNA molecule having 
a nucleotide sequence that occurs in nature (e.g., encodes a natural protein). 

In addition to naturally-occurring allelic variants of the hVR-1, hVR-2, and rVR-2 
sequences that may exist in the population, the skilled artisan will further appreciate that 
20 changes can be introduced by mutation into the nucleotide sequences of SEQ ID NO:l, 3, 
4, 6, 7, 9, 10, or 12, thereby leading to changes in the amino acid sequence of the encoded 
hVR-1, hVR-2, and rVR-2 proteins, without altering the functional ability of the hVR-1, 
hVR-2. and rVR-2 proteins. For example, nucleotide substitutions leading to amino acid 
substitutions at "non-essential" amino acid residues can be made in the sequence of SEQ 
25 ID NO:l, 3, 4, 6, 7, 9, 10, or 12. A "non-essential" amino acid residue is a residue that 
can be altered from the wild-type sequence of hVR-1, hVR-2, and rVR-2 (e.g., the 
sequence of SEQ ID NO:2, 5, 8, or 1 1) without altering the biological activity, whereas 
an "essential" amino acid residue is required for biological activity. For example, amino 
acid residues that are conserved among the hVR-1, hVR-2, and rVR-2 proteins of the 
30 present invention, are predicted to be particularly unamenable to alteration. Furthermore, 
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additional amino acid residues that are conserved between the hVR-1, hVR-2, and rVR-2 
proteins of the present invention and other members of the Capsaicin/Vanilloid receptor 
family are not likely to be amenable to alteration. 

Accordingly, another aspect of the invention pertains to nucleic acid molecules 
5 encoding hVR-1 , hVR-2, and rVR-2 proteins that contain changes in amino acid residues 
that are not essential for activity. Such hVR-1 , hVR-2, and rVR-2 proteins differ in 
amino acid sequence from SEQ ID NO:2, 5, 8, or 1 1, yet retain biological activity, in 
one embodiment, the isolated nucleic acid molecule comprises a nucleotide sequence 
encoding a protein, wherein the protein comprises an amino acid sequence at least about 
10 60%, 65%, 70%, 75%, 80%, 85%, 87%, 90%, 95%, 98% or more homologous to .SEQ 
ID NO:2, 5, 8, or 1L 

An isolated nucleic acid molecule encoding an hVR-1, hVR-2, and rVR-2 protein 
homologous to the protein of SEQ ID NO:2, 5, 8, or 1 1 can be created by introducing one 
or more nucleotide substitutions, additions or deletions into the nucleotide sequence of 
15 SEQ ID NO: K 3, 4, 6, 7, 9, 10, or 12, such that one or more amino acid substitutions, 
additions or deletions are introduced into the encoded protein. Mutations can be 
introduced into SEQ ID NO:l, 3, 4, 6, 7, 9, 10, or 12, by standard techniques, such as 
site-directed mutagenesis and PCR-mediated mutagenesis. Preferably, conservative 
amino acid substitutions are made at one or more predicted non-essential amino acid 
20 residues. A "conservative amino acid substitution" is one in which the amino acid 

residue is replaced with an amino acid residue having a similar side chain. Families of 
amino acid residues having similar side chains have been defined in the art. These 
families include amino acids with basic side chains (e.g., lysine, arginine, histidine), 
acidic side chains (e.g., aspartic acid, glutamic acid), uncharged polar side chains (e.g., 
25 glycine, asparagine, glutamine, serine, threonine, tyrosine, cysteine), nonpolar side chains 
(e.g., alanine, valine, leucine, isoleucine, proline, phenylalanine, methionine, tryptophan), 
beta-branched side chains (e.g., threonine, valine, isoleucine) and aromatic side chains 
O-g., tyrosine, phenylalanine, tryptophan, histidine). Thus, a predicted nonessential 
amino acid residue in an hVR-1 , hVR-2, and rVR-2 protein is preferably replaced with 
30 another amino acid residue from the same side chain family. Alternatively, in another 
embodiment, mutations can be introduced randomly along all or part of an hVR-1, hVR- 
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2, and rVR-2 coding sequence, such as by saturation mutagenesis, and the resultant 
mutants can be screened for hVR-1, hVR-2 ? and rVR-2 biological activity to identify 
mutants that retain activity. Following mutagenesis of SEQ ID NO: 1, 3, 4, 6, 7, 9, 10, or 
12. 

5 In a embodiment, a mutant hVR-1 , hVR-2, and rVR-2 protein can be assayed for 

the ability to (1) interact with a non-hVR-1, non-hVR-2, or non- rVR-2 protein molecule, 
e.g., a vanilloid compound such as capsaicin; (2) modulate intracellular calcium 
concentration; (3) activate an hVR-1, hVR-2, and rVR-2-dependent signal transduction 
pathway; or (4) modulate a pain signaling mechanism. 
10 In addition to the nucleic acid molecules encoding hVR-1, hVR-2, and rVR-2 

proteins described above, another aspect of the invention pertains to isolated nucleic acid 
molecules which are antisense thereto. An "antisense" nucleic acid comprises a 
nucleotide sequence which is complementary to a "sense" nucleic acid encoding a 
protein, e.g., complementary to the coding strand of a double-stranded cDNA molecule or 
1 5 complementary to an mRNA sequence. Accordingly, an antisense nucleic acid can 

hydrogen bond to a sense nucleic acid. The antisense nucleic acid can be complementary 
to an entire hVR-K hVR-2, and rVR-2 coding strand, or to only a portion thereof In one 
embodiment, an antisense nucleic acid molecule is antisense to a "coding region" of the 
coding strand of a nucleotide sequence encoding hVR-1, hVR-2, and rVR-2. The term 

20 "coding region" refers to the region of the nucleotide sequence comprising codons which 
are translated into amino acid residues (e.g., the coding region of hVR-1, hVR-2, and 
rVR-2). In another embodiment, the antisense nucleic acid molecule is antisense to a 
"noncoding region" of the coding strand of a nucleotide sequence encoding hVR-1, hVR- 
2, and rVR-2. The term "noncoding region" refers to 5* and 3' sequences which flank the 

25 coding region that are not translated into amino acids (i.e., also referred to as 5 r and 3' 
untranslated regions). 

Given the coding strand sequences encoding hVR-1, hVR-2, and rVR-2 disclosed 
herein, antisense nucleic acids of the invention can be designed according to the rules of 
Watson and Crick base pairing. The antisense nucleic acid molecule can be 

30 complementary to the entire coding region of hVR-1, hVR-2, and rVR-2 mRNA, but 
more preferably is an oligonucleotide which is antisense to only a portion of the coding 
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or noncoding region of hVR-1, hVR-2, and rVR-2 mRNA. For example, the antisense 
oligonucleotide can be complementary to the region surrounding the translation start site 
of hVR-K hVR-2, and rVR-2 mRNA. An antisense oligonucleotide can be, for example, 
about 5, 10, 15, 20, 25, 30, 35, 40, 45 or 50 nucleotides in length. An antisense nucleic 
5 acid of the invention can be constructed using chemical synthesis and enzymatic ligation 
reactions using procedures known in the art. For example, an antisense nucleic acid (e.g., 
an antisense oligonucleotide) can be chemically synthesized using naturally occurring 
nucleotides or variously modified nucleotides designed to increase the biological stability 
of the molecules or to increase the physical stability of the duplex formed between the 
10 antisense and sense nucleic acids, e.g., phosphorothioate derivatives and acridine 

substituted nucleotides can be used. Examples of modified nucleotides which can be 
used to generate the antisense nucleic acid include 5-fluorouracil, 5-bromouracil, 5- 
chlorouracil, 5-iodouracil, hypoxanthine, xantine, 4-acetyIcytosine, 5- 
(carboxyhydroxylmethyl) uracil, 5-carboxymethylaminomethyI-2-thiouridine, 5- 

15 carboxymethylaminomethyluracil, dihydrouracil, beta-D-galactosylqueosine, inosine, 
N6-isopentenyladenine, 1-methylguanine, 1 -methy linosine, 2,2-dimethylguanine, 2- 
methyladenine, 2-methylguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 7- 
methylguanine, 5-methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, beta- 
D-mannosylqueosine, 5'-methoxycarboxymethyluracil, 5-methoxyuracil, 2-methylthio- 

20 N6-isopentenyladenine, uracil-5-oxyacetic acid (v), wybutoxosine, pseudouracil, 

queosine, 2-thiocytosine, 5-methyl-2-thiouracil, 2-thiouraciI, 4-thiouracil, 5-methyluracil, 
uracil-5- oxyacetic acid methylester, uracil-5-oxyacetic acid (v), 5-methyl-2-thiouracil, 3- 
(3-amino-3-N-2-carboxypropyl) uracil, (acp3)w, and 2,6-diaminopurine. Alternatively, 
the antisense nucleic acid can be produced biologically using an expression vector into 

25 which a nucleic acid has been subcloned in an antisense orientation (i.e., RNA 

transcribed from the inserted nucleic acid will be of an antisense orientation to a target 
nucleic acid of interest, described further in the following subsection). 

The antisense nucleic acid molecules of the invention are typically administered 
to a subject or generated in situ such that they hybridize with or bind to cellular mRNA 

30 and/or genomic DNA encoding an hVR-1 , hVR-2, and rVR-2 protein to thereby inhibit 
expression of the protein, e.g. , by inhibiting transcription and/or translation. The 
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hybridization can be by conventional nucleotide complementarity to form a stable 
duplex, on, for example, in the case of an antisense nucleic acid molecule which binds to 
DNA duplexes, through specific interactions in the major groove of the double helix. 
An example of a route of administration of antisense nucleic acid molecules of the 
5 invention include direct injection at a tissue site. Alternatively, antisense nucleic acid 
molecules can be modified to target selected cells and then administered systemically. 
For example, for systemic administration, antisense molecules can be modified such that 
they specifically bind to receptors or antigens expressed on a selected cell surface, e.g., 
by linking the antisense nucleic acid molecules to peptides or antibodies which bind to 
10 cell surface receptors or antigens. The antisense nucleic acid molecules can also be 

delivered to cells using the vectors described herein. To achieve sufficient intracellular 
concentrations of the antisense molecules, vector constructs in which the antisense 
nucleic acid molecule is placed under the control of a strong pol II or pol III promoter 
are preferred. 

1 5 In yet another embodiment, the antisense nucleic acid molecule of the invention 

is an -anomeric nucleic acid molecule. An a-anomeric nucleic acid molecule forms 
specific double-stranded hybrids with complementary RNA in which, contrary to the 
usual -units, the strands run parallel to each other (Gaultier et ah (1987) Nucleic Acids. 
Res. 15:6625-6641). The antisense nucleic acid molecule can also comprise a 2 f -o- 

20 methylribonucleotide (Inoue et ah (1987) Nucleic Acids Res. 15:6131-6148) or a 
chimeric RNA-DNA analogue (Inoue et ah (1987) FEBS Lett. 215:327-330). 

In still another embodiment, an antisense nucleic acid of the invention is a 
ribozyme. Ribozymes are catalytic RNA molecules with ribonuclease activity which are 
capable of cleaving a single-stranded nucleic acid, such as an mRNA, to which they 

25 have a complementary region. Thus, ribozymes {e.g., hammerhead ribozymes 
(described in Haselhoff and Gerlach (1988) Nature 334:585-591)) can be used to 
catalytically cleave hVR-1, hVR-2, and rVR-2 mRNA transcripts to thereby inhibit 
translation of hVR-1, hVR-2, and rVR-2 mRNA. A ribozyme having specificity for an 
hVR-1, hVR-2, and rVR-2-encoding nucleic acid can be designed based upon the 

30 nucleotide sequence of an hVR-1, hVR-2, and rVR-2 cDNA disclosed herein {i.e., SEQ 
ID NO:l, 3, 4, 6, 7, 9, 10, or 12). For example, a derivative of a Tetrahymena L-19 IVS 
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RNA can be constructed in which the nucleotide sequence of the active site is 
complementary to the nucleotide sequence to be cleaved in an hVR-1, hVR-2, and rVR- 
2-encoding mRNA. See, e.g., Cech et al U.S. Patent No. 4,987,071; and Cech et al 
U.S. Patent No. 5,1 16,742. Alternatively. hVR-1, hVR-2, and rVR-2 mRNA can be 
5 used to select a catalytic RNA having a specific ribonuclease activity from a pool of 
RNA molecules. See. for example, Bartel, D. and Szostak, J.W. (1993) Science 
261:1411-1418. 

Alternatively, hVR-1, hVR-2, and rVR-2 gene expression can be inhibited by 
targeting nucleotide sequences complementary to the regulatory region of the hVR-1, 
1 0 hVR-2. and rVR-2 (e.g. , the hVR- 1 , hVR-2, and rVR-2 promoter and/or enhancers) to 
form triple helical structures that prevent transcription of the hVR-1, hVR-2, and rVR-2 
gene in target cells. See generally, Helene, C. (\99\) Anticancer Drug Des. 6(6):569- 
84; Helene, C et al (1992) Ann. N. Y. Acad Sci. 660:27-36; and Maher, LJ. (1992) 
Bioassays 14(1 2): 807- 15. 

15 In yet another embodiment, the hVR-1, hVR-2, and rVR-2 nucleic acid 

molecules of the present invention can be modified at the base moiety, sugar moiety or 
phosphate backbone to improve, e.g., the stability, hybridization, or solubility of the 
molecule. For example, the deoxyribose phosphate backbone of the nucleic acid 
molecules can be modified to generate peptide nucleic acids (see Hyrup B. et al. (1996) 

20 Bioorganic & Medicinal Chemistry 4(1): 5-23). As used herein, the terms "peptide 
nucleic acids" or "PNAs" refer to nucleic acid mimics, e.g., DNA mimics, in which the 
deoxyribose phosphate backbone is replaced by a pseudopeptide backbone and only the 
four natural nucleobases are retained. The neutral backbone of PNAs has been shown to 
allow for specific hybridization to DNA and RNA under conditions of low ionic 

25 strength. The synthesis of PNA oligomers can be performed using standard solid phase 
peptide synthesis protocols as described in Hyrup B. et al (1996) supra; Perry-O'Keefe 
etal Proc. Natl. Acad. Sci. 93: 14670-675. 

PNAs of hVR-1 , hVR-2, and rVR-2 nucleic acid molecules can be used in 
therapeutic and diagnostic applications. For example, PNAs can be used as antisense or 

30 antigene agents for sequence-specific modulation of gene expression by, for example, 
inducing transcription or translation arrest or inhibiting replication. PNAs of hVR-1, 
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hVR-2, and rVR-2 nucleic acid molecules can also be used in the analysis of single base 
pair mutations in a gene, (e.g., by PNA-directed PGR clamping); as 'artificial restriction 
enzymes' when used in combination with other enzymes, (e.g., SI nucleases (Hyrup B. 
(1 996) supra)): or as probes or primers for DNA sequencing or hybridization (Hyrup B. 
5 et al. (1996) supra; Perry-O'Keefe supra). 

In another embodiment, PNAs of hVR-1, hVR-2, and rVR-2 can be modified, 
(e.g., to enhance their stability or cellular uptake), by attaching lipophilic or other helper 
groups to PNA, by the formation of PNA-DNA chimeras, or by the use of liposomes or 
other techniques of drug delivery known in the art. For example, PNA-DNA chimeras 
10 of hVR-1 , hVR-2, and rVR-2 nucleic acid molecules can be generated which may 
combine the advantageous properties of PNA and DNA. Such chimeras allow DNA 
recognition enzymes, (e.g., RNAse H and DNA polymerases), to interact with the DNA 
portion while the PNA portion would provide high binding affinity and specificity. 
PNA-DNA chimeras can be linked using linkers of appropriate lengths selected in terms 
15 of base stacking, number of bonds between the nucleobases, and orientation (Hyrup B. 
(1996) supra). The synthesis of PNA-DNA chimeras can be performed as described in 
Hyrup B. (1996) supra and Finn P.J. et al (1996) Nucleic Acids Res. 24 (17): 3357-63. 
For example, a DNA chain can be synthesized on a solid support using standard 
phosphoramidite coupling chemistry and modified nucleoside analogs, e.g., 5*-(4- 
20 methoxytrityl)amino-5'-deoxy-thymidine phosphoramidite, can be used as a between the 
PNA and the 5* end of DNA (Mag, M. et ai (1989) Nucleic Acid Res. 17: 5973-88). 
PNA monomers are then coupled in a stepwise manner to produce a chimeric molecule 
with a 5' PNA segment and a 3' DNA segment (Finn P.J. et al. (1996) supra). 
Alternatively, chimeric molecules can be synthesized with a 5' DNA segment and a 3' 
25 PNA segment (Peterser, K.H. et ai (1975) Bioorganic Med. Chem. Lett. 5:1119-11 124). 

In other embodiments, the oligonucleotide may include other appended groups 
such as peptides (e.g., for targeting host cell receptors in vivo), or agents facilitating 
transport across the cell membrane (see, e.g., Letsinger et al. (1989) Proc. Natl. Acad. 
Sci. USA 86:6553-6556; Lemaitre et al. (1987) Proc. Natl. Acad. Sci. USA 84:648-652; 
30 PCT Publication No. W088/098 1 0) or the blood-brain barrier (see, e.g. , PCT Publication 
No. W089/10134). In addition, oligonucleotides can be modified with hybridization- 
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triggered cleavage agents (See, e.g., Krol et ai (1988) Bio-Techniques 6:958-976) or 
intercalating agents. (See, e.g., Zon (1988) Pharm. Res. 5:539-549). To this end, the 
oligonucleotide may be conjugated to another molecule, (e.g., a peptide, hybridization 
triggered cross-linking agent, transport agent, or hybridization-triggered cleavage agent). 

5 

II. Isolated hVR-L hVR-2, and rVR-2 Proteins and Anti-hVR-K Anti-hVR-2. and Anti- 
rVR-2 Antibodies 

One aspect of the invention pertains to isolated hVR-1, hVR-2, and rVR-2 
proteins, and biologically active portions thereof, as well as polypeptide fragments 
10 suitable for use as immunogens to raise anti-hVR-2, anti-hVR-2, and anti-rVR-2 
antibodies. In one embodiment, native hVR-1, hVR-2, and rVR-2 proteins can be 
isolated from cells or tissue sources by an appropriate purification scheme using 
standard protein purification techniques. In another embodiment, hVR-1, hVR-2, and 
rVR-2 proteins are produced by recombinant DNA techniques. Alternative to 
1 5 recombinant expression, an hVR-1 , hVR-2, and rVR-2 protein or polypeptide can be 
synthesized chemically using standard peptide synthesis techniques. 

An "isolated" or "purified" protein or biologically active portion thereof is 
substantially free of cellular material or other contaminating proteins from the cell or 
tissue source from which the hVR-1, hVR-2, and rVR-2 protein is derived, or 

20 substantially free from chemical precursors or other chemicals when chemically 

synthesized. The language "substantially free of cellular material" includes preparations 
of hVR-1, hVR-2, and rVR-2 protein in which the protein is separated from cellular 
components of the cells from which it is isolated or recombinantly produced. In one 
embodiment, the language "substantially free of cellular material" includes preparations 

25 of hVR-1, hVR-2, and rVR-2 protein having less than about 30% (by dry weight) of 
non-hVR-1, hVR-2, and rVR-2 protein (also referred to herein as a "contaminating 
protein"), more preferably less than about 20% of non-hVR-1, hVR-2, and rVR-2 
protein, still more preferably less than about 10% of non-hVR-1, hVR-2, and rVR-2 
protein, and most preferably less than about 5% non-hVR-1, non-hVR-2, and non-rVR-2 

30 protein. When the hVR- 1 , hVR-2, and rVR-2 protein or biologically active portion 
thereof is recombinantly produced, it is also preferably substantially free of culture 
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medium, i.e., culture medium represents less than about 20%, more preferably less than 
about 10%, and most preferably less than about 5% of the volume of the protein 
preparation. 

The language "substantially free of chemical precursors or other chemicals" 
5 includes preparations of hVR-L hVR-2, and rVR-2 protein in which the protein is 
separated from chemical precursors or other chemicals which are involved in the 
synthesis of the protein. In one embodiment, the language "substantially free of 
chemical precursors or other chemicals" includes preparations of hVR-1, hVR-2, and 
rVR-2 protein having less than about 30% (by dry weight) of chemical precursors or 
10 non-hVR-1, hVR-2, and rVR-2 chemicals, more preferably less than about 20% 

chemical precursors or non-hVR-1, hVR-2, and rVR-2 chemicals, still more preferably 
less than about 10% chemical precursors or non-hVR-1. hVR-2, and rVR-2 chemicals, 
and most preferably less than about 5% chemical precursors or non-hVR-1, hVR-2, and 
rVR-2 chemicals. 

15 As used herein, a "biologically active portion" of an hVR-1, hVR-2, and rVR-2 

protein includes a fragment of an hVR-1, hVR-2, and rVR-2 protein which participates 
in an interaction between an hVR-1, hVR-2, and rVR-2 molecule and a non-hVR-1, 
non-hVR-2, and non-rVR-2 molecule, respectively. Biologically active portions of an 
hVR-1, hVR-2. and rVR-2 protein include peptides comprising amino acid sequences 

20 sufficiently homologous to or derived from the amino acid sequence of the hVR-1 , hVR- 
2, and rVR-2 protein, e.g., the amino acid sequence shown in SEQ ID NO:2, 5, 8, or 1 1, 
which include less amino acids than the full length hVR-1, hVR-2, and rVR-2 proteins, 
and exhibit at least one activity of an hVR-1, hVR-2, and rVR-2 protein. Typically, 
biologically active portions comprise a domain or motif with at least one activity of the 

25 hVR-1, hVR-2, and rVR-2 protein, e.g., binding of an hVR-1, hVR-2, and rVR-2 ligand 
such as a vanilloid compound, e.g., Capsaicin. A biologically active portion of an hVR- 
1, hVR-2, and rVR-2 protein can be a polypeptide which is, for example, 10, 20, 30, 40, 
50, 60, 70, 80, 90, 100, 200 or more amino acids in length. Biologically active portions 
of an hVR-1, hVR-2, and rVR-2 protein can be used as targets for developing agents 

30 which modulate an hVR-1 , hVR-2, and rVR-2 mediated activity, e.g., a pain signaling 
mechanism. 
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In one embodiment, a biologically active portion of an hVR-1, hVR-2, and rVR- 
2 protein comprises at least one transmembrane domain, and/or at least one proline rich 
domain, and/or at least one ankyrin repeat domain. It is to be understood that a 
biologically active portion of an hVR-1, hVR-2, and rVR-2 protein of the present 
5 invention may contain at least one of the above-identified structural domains. A more 
biologically active portion of an hVR-1, hVR-2, and rVR-2 protein may contain at least 
two of the above-identified structural domains. Moreover, other biologically active 
portions, in which other regions of the protein are deleted, can be prepared by 
recombinant techniques and evaluated for one or more of the functional activities of a 

1 0 native h VR- 1 , hVR-2, and rVR-2 protein. 

In a embodiment, the hVR-1, hVR-2. and rVR-2 protein has an amino acid 
sequence shown in SEQ ID NO:2, 5, 8, or 1 1. In other embodiments, the hVR-1, hVR- 
2, and rVR-2 protein is substantially homologous to SEQ ID NO:2, 5, 8, or 1 1, and 
retains the functional activity of the protein of SEQ ID NO:2, 5, 8, or 1 L yet differs in 

1 5 amino acid sequence due to natural allelic variation or mutagenesis, as described in 
detail in subsection I above. Accordingly, in another embodiment, the hVR-1, hVR-2, 
and rVR-2 protein is a protein which comprises an amino acid sequence at least about 
60%, 65%, 70%, 75%, 80%, 85%, 87%, 90%, 95%, 98% or more homologous to SEQ 
ID NO:2, 5, 8, or 11. 

20 To determine the percent identity of two amino acid sequences or of two nucleic 

acid sequences, the sequences are aligned for optimal comparison purposes (e.g., gaps 
can be introduced in one or both of a first and a second amino acid or nucleic acid 
sequence for optimal alignment and non-homologous sequences can be disregarded for 
comparison purposes). In a embodiment, the length of a reference sequence aligned for 

25 comparison purposes is at least 30%, preferably at least 40%, more preferably at least 
50%, even more preferably at least 60%, and even more preferably at least 70%, 80%, or 
90% of the length of the reference sequence (e.g., when aligning a second sequence to 
the hVR-1, hVR-2, and rVR-2 amino acid sequence of SEQ ID NO:2 ? 5, 8, or 1 1, having 
177 amino acid residues, at least 80, preferably at least 100, more preferably at least 120, 

30 even more preferably at least 140, and even more preferably at least 150, 160 or 170 
amino acid residues are aligned). The amino acid residues or nucleotides at 
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corresponding amino acid positions or nucleotide positions are then compared. When a 
position in the first sequence is occupied by the same amino acid residue or nucleotide 
as the corresponding position in the second sequence, then the molecules are identical at 
that position (as used herein amino acid or nucleic acid "identity" is equivalent to amino 
5 acid or nucleic acid "homology"). The percent identity between the two sequences is a 
function of the number of identical positions shared by the sequences, taking into 
account the number of gaps, and the length of each gap, which need to be introduced for 
optimal alignment of the two sequences. 

The comparison of sequences and determination of percent identity between two 
10 sequences can be accomplished using a mathematical algorithm. In a embodiment, the 
percent identity between two amino acid sequences is determined using the Needleman 
and Wunsch (./. Mol Biol (48):444-453 (1970)) algorithm which has been incorporated 
into the GAP program in the GCG software package (available at http://www.gcg.com), 
using either a Blossum 62 matrix or a PAM250 matrix, and a gap weight of 16, 14, 12, 
15 10, 8, 6, or 4 and a length weight of 1, 2, 3, 4, 5, or 6. In yet another embodiment, the 
percent identity between two nucleotide sequences is determined using the GAP 
program in the GCG software package (available at http://www.gcg.com), using a 
NWSgapdna.CMP matrix and a gap weight of 40, 50, 60, 70, or 80 and a length weight 
of 1, 2, 3, 4, 5, or 6. In another embodiment, the percent identity between two amino 
20 acid or nucleotide sequences is determined using the algorithm of E. Meyers and W. 

Miller (CABIOS, 4:1 1-17 (1989)) which has been incorporated into the ALIGN program 
(version 2.0), using a PAM120 weight residue table, a gap length penalty of 12 and a 
gap penalty of 4. 

The nucleic acid and protein sequences of the present invention can further be 
25 used as a "query sequence" to perform a search against public databases to, for example, 
identify other family members or related sequences. Such searches can be performed 
using the NBLAST and XBLAST programs (version 2.0) of Altschul, et al (1990) J. 
Mol Biol 215:403-10. BLAST nucleotide searches can be performed with the 
NBLAST program, score = 100, wordlength = 12 to obtain nucleotide sequences 
30 homologous to hVR-1, hVR-2, and rVR-2 nucleic acid molecules of the invention. 
BLAST protein searches can be performed with the XBLAST program, score = 50, 
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wordlength = 3 to obtain amino acid sequences homologous to hVR-1, hVR-2, and rVR- 
2 protein molecules of the invention. To obtain gapped alignments for comparison 
purposes. Gapped BLAST can be utilized as described in Altschul el al. r (1997) Nucleic 
Acids Res. 25(17):3389-3402. When utilizing BLAST and Gapped BLAST programs, 
5 the default parameters of the respective programs {e.g., XBLAST and NBLAST) can be 
used. See http://www.ncbi.nlm.nih.gov. 

The invention also provides hVR-1, hVR-2, and rVR-2 chimeric or fusion 
proteins. As used herein, an hVR-1, hVR-2, and rVR-2 "chimeric protein" or "fusion 
protein" comprises an hVR-1, hVR-2, and rVR-2 polypeptide operatively linked to a 
1 0 non-hVR- 1 , h VR-2, and rVR-2 polypeptide. An "h VR- 1 , hVR-2, and rVR-2 

polypeptide" refers to a polypeptide having an amino acid sequence corresponding to 
hVR-1. hVR-2, and rVR-2, whereas a "non-hVR-l, non-hVR-2, and non-rVR-2 
polypeptide" refers to a polypeptide having an amino acid sequence corresponding to a 
protein which is not substantially homologous to the hVR-1, hVR-2, and rVR-2 protein, 
1 5 e.g., a protein which is different from the hVR-1, hVR-2, and rVR-2 protein and which 
is derived from the same or a different organism. Within an hVR-1, hVR-2, and rVR-2 
fusion protein the hVR-1, hVR-2, and rVR-2 polypeptide can correspond to all or a 
portion of an hVR-1, hVR-2, and rVR-2 protein. In a embodiment, an hVR-1, hVR-2, 
and rVR-2 fusion protein comprises at least one biologically active portion of an hVR-1, 

20 hVR-2, and rVR-2 protein. In another embodiment, an hVR-1, hVR-2, and rVR-2 

fusion protein comprises at least two biologically active portions of an hVR-1, hVR-2, 
and rVR-2 protein. Within the fusion protein, the term "operatively linked" is intended 
to indicate that the hVR-1, hVR-2, and rVR-2 polypeptide and the non-hVR-l, non- 
hVR-2, and non-rVR-2 polypeptide are fused in-frame to each other. The non-hVR-l, 

25 h VR-2, and rVR-2 polypeptide can be fused to the N-terminus or C-terminus of the 
hVR-1, hVR-2, and rVR-2 polypeptide. 

For example, in one embodiment, the fusion protein is a GST-hVR-1, GST- 
hVR-2, and GST-rVR-2 fusion protein in which the hVR-1, hVR-2, and rVR-2 
sequences are fused to the C-terminus of the GST sequences. Such fusion proteins can 

30 facilitate the purification of recombinant hVR-1, hVR-2, and rVR-2. 



BNSDOCID: <WO 0029S77A1 IA> 



WO 00/29577 PCT/US99/2670I 

- 34 - 

In another embodiment, the fusion protein is an hVR-1, hVR-2, and rVR-2 
protein containing a heterologous signal sequence at its N-terminus. In certain host cells 
(e.g., mammalian host cells), expression and/or secretion of hVR-1 , hVR-2, and rVR-2 
can be increased through use of a heterologous signal sequence. 
5 The hVR-1 , hVR-2, and rVR-2 fusion proteins of the invention can be 

incorporated into pharmaceutical compositions and administered to a subject in vivo. 
The hVR-1, hVR-2, and rVR-2 fusion proteins can be used to affect the bioavailability 
of an hVR-1, hVR-2, and rVR-2 substrate. Use of hVR-l ? hVR-2, and rVR-2 fusion 
proteins may be useful therapeutically for the treatment of disorders caused by, for 
10 example, (i) aberrant modification or mutation of a gene encoding an hVR-1, hVR-2, 
and rVR-2 protein; (ii) mis-regulation of the hVR-L hVR-2, and rVR-2 gene; and (iii) 
aberrant post-translational modification of an hVR-1, hVR-2, and rVR-2 protein. 

Moreover, the hVR-1, hVR-2, and rVR-2-fusion proteins of the invention can be 
used as immunogens to produce anti-hVR-1. anti-hVR-2, and anti-rVR-2 antibodies in a 
15 subject, to purify hVR-1, hVR-2, and rVR-2 ligands and in screening assays to identify 
molecules which inhibit the interaction of hVR-1, hVR-2, and rVR-2 with an hVR-1, 
hVR-2 ; and rVR-2 substrate. 

Preferably, an hVR-1, hVR-2, and rVR-2 chimeric or fusion protein of the 
invention is produced by standard recombinant DNA techniques. For example, DNA 
20 fragments coding for the different polypeptide sequences are ligated together in-frame in 
accordance with conventional techniques, for example by employing blunt-ended or 
stagger-ended termini for ligation, restriction enzyme digestion to provide for 
appropriate termini, filling-in of cohesive ends as appropriate, alkaline phosphatase 
treatment to avoid undesirable joining, and enzymatic ligation. In another embodiment, 
25 the fusion gene can be synthesized by conventional techniques including automated 
DNA synthesizers. Alternatively, PCR amplification of gene fragments can be carried 
out using anchor primers which give rise to complementary overhangs between two 
consecutive gene fragments which can subsequently be annealed and reamplified to 
generate a chimeric gene sequence (see, for example, Current Protocols in Molecular 
30 Biology, eds. Ausubel et al John Wiley & Sons: 1992). Moreover, many expression 
vectors are commercially available that already encode a fusion moiety (e.g., a GST 
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polypeptide). An hVR-K hVR-2, and rVR-2-encoding nucleic acid can be cloned into 
such an expression vector such that the fusion moiety is linked in-frame to the hVR-1, 
hVR-2, and rVR-2 protein. 

The present invention also pertains to variants of the hVR-1, hVR-2, and rVR-2 
5 proteins which function as either hVR- 1 , hVR-2, and rVR-2 agonists (mimetics) or as 
hVR-1, hVR-2, and rVR-2 antagonists. Variants of the hVR-1, hVR-2, and rVR-2 
proteins can be generated by mutagenesis, e.g., discrete point mutation or truncation of 
an hVR- 1 , h VR-2, and rVR-2 protein. An agonist of the h VR- 1 , h VR-2, and rVR-2 
proteins can retain substantially the same, or a subset, of the biological activities of the 

10 naturally occurring form of an hVR-L hVR-2, and rVR-2 protein. An antagonist of an 
hVR-1, hVR-2, and rVR-2 protein can inhibit one or more of the activities of the 
naturally occurring form of the hVR-K hVR-2, and rVR-2 protein by, for example, 
competitively modulating an hVR-1, hVR-2, and rVR-2-mediated activity of an hVR-1, 
hVR-2, and rVR-2 protein. Thus, specific biological effects can be elicited by treatment 

1 5 with a variant of limited function. In one embodiment, treatment of a subject with a 
variant having a subset of the biological activities of the naturally occurring form of the 
protein has fewer side effects in a subject relative to treatment with the naturally 
occurring form of the hVR-1, hVR-2, and rVR-2 protein. 

In one embodiment, variants of an hVR-1, hVR-2, and rVR-2 protein which 

20 function as either hVR-1 , hVR-2, and rVR-2 agonists (mimetics) or as hVR-1, hVR-2, 
and rVR-2 antagonists can be identified by screening combinatorial libraries of mutants, 
e.g., truncation mutants, of an hVR-1, hVR-2, and rVR-2 protein for hVR-1, hVR-2, and 
rVR-2 protein agonist or antagonist activity. In one embodiment, a variegated library of 
hVR-1, hVR-2, and rVR-2 variants is generated by combinatorial mutagenesis at the 

25 nucleic acid level and is encoded by a variegated gene library. A variegated library of 
hVR-1, hVR-2, and rVR-2 variants can be produced by, for example, enzymatically 
ligating a mixture of synthetic oligonucleotides into gene sequences such that a 
degenerate set of potential hVR-1, hVR-2, and rVR-2 sequences is expressible as 
individual polypeptides, or alternatively, as a set of larger fusion proteins (e.g., for phage 

30 display) containing the set of hVR-1, hVR-2, and rVR-2 sequences therein. There are a 
variety of methods which can be used to produce libraries of potential hVR-1, hVR-2, 
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and rVR-2 variants from a degenerate oligonucleotide sequence. Chemical synthesis of 
a degenerate gene sequence can be performed in an automatic DNA synthesizer, and the 
synthetic gene then ligated into an appropriate expression vector Use of a degenerate 
set of genes allows for the provision, in one mixture, of all of the sequences encoding 
5 the desired set of potential hVR-1, hVR-2, and rVR-2 sequences. Methods for 

synthesizing degenerate oligonucleotides are known in the art (see, e.g., Narang, S.A. 
(1983) Tetrahedron 39:3; Itakura et al (1984) Annu. Rev. Biochem. 53:323; Itakura et 
al (1984) Science 198:1056; Ike et al (1983) Nucleic Acid Res. 11:477. 

In addition, libraries of fragments of an hVR-1 , hVR-2, and rVR-2 protein 
10 coding sequence can be used to generate a variegated population of hVR-1, hVR-2, and 
rVR-2 fragments for screening and subsequent selection of variants of an hVR-1, hVR- 
2, and rVR-2 protein. In one embodiment, a library of coding sequence fragments can 
be generated by treating a double stranded PCR fragment of an hVR-1, hVR-2, and 
rVR-2 coding sequence with a nuclease under conditions wherein nicking occurs only 
1 5 about once per molecule, denaturing the double stranded DNA, renaturing the DNA to 
form double stranded DNA which can include sense/antisense pairs from different 
nicked products, removing single stranded portions from reformed duplexes by 
treatment with SI nuclease, and ligating the resulting fragment library into an expression 
vector. By this method, an expression library can be derived which encodes N-terminal, 
20 C-terminal and internal fragments of various sizes of the hVR-L hVR-2, and rVR-2 
protein. 

Several techniques are known in the art for screening gene products of 
combinatorial libraries made by point mutations or truncation, and for screening cDNA 
libraries for gene products having a selected property. Such techniques are adaptable for 

25 rapid screening of the gene libraries generated by the combinatorial mutagenesis of 
hVR-1, hVR-2, and rVR-2 proteins. The most widely used techniques, which are 
amenable to high through-put analysis, for screening large gene libraries typically 
include cloning the gene library into replicable expression vectors, transforming 
appropriate cells with the resulting library of vectors, and expressing the combinatorial 

30 genes under conditions in which detection of a desired activity facilitates isolation of the 
vector encoding the gene whose product was detected. Recrusive ensemble mutagenesis 
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(REM), a new technique which enhances the frequency of functional mutants in the 
libraries, can be used in combination with the screening assays to identify hVR-L hVR- 
2, and rVR-2 variants (Arkin and Yourvan (1992) Proc. Nail. Acad. Sci. USA 59:781 1- 
7815; Delgrave et ai (1993) Protein Engineering 6(3):327-331). 
5 In one embodiment, cell based assays can be exploited to analyze a variegated 

hVR-L hVR-2, and rVR-2 library. For example, a library of expression vectors can be 
transfected into a cell line, e.g., a neuronal cell line, which ordinarily responds to a 
particular ligand in an hVR-1, hVR-2, and rVR-2-dependent manner. The transfected 
cells are then contacted with the ligand and the effect of expression of the mutant on 

10 signaling by the ligand can be detected, e.g., by measuring intracellular calcium 

concentration, neuronal membrane depolarization, or the activity of an hVR-1, hVR-2, 
and rVR-2-regulated transcription factor. Plasmid DNA can then be recovered from the 
cells which score for inhibition, or alternatively, potentiation of signaling by the ligand, 
and the individual clones further characterized. 

15 An isolated hVR-1, hVR-2, and rVR-2 protein, or a portion or fragment thereof, 

can be used as an immunogen to generate antibodies that bind hVR-1, hVR-2, and rVR- 
2 using standard techniques for polyclonal and monoclonal antibody preparation. A 
full-length hVR-1, hVR-2, and rVR-2 protein can be used or, alternatively, the invention 
provides antigenic peptide fragments of hVR-1, hVR-2, and rVR-2 for use as 

20 immunogens. The antigenic peptide of hVR-1, hVR-2, and rVR-2 comprises at least 8 
amino acid residues of the amino acid sequence shown in SEQ ID NO:2, 5, 8, or 1 1 and 
encompasses an epitope of hVR-l ; hVR-2, and rVR-2 such that an antibody raised 
against the peptide forms a specific immune complex with hVR-1, hVR-2, and rVR-2. 
Preferably, the antigenic peptide comprises at least 10 amino acid residues, more 

25 preferably at least 1 5 amino acid residues, even more preferably at least 20 amino acid 
residues, and most preferably at least 30 amino acid residues. 

Epitopes encompassed by the antigenic peptide are regions of hVR-1, hVR-2, 
and rVR-2 that are located on the surface of the protein, e.g., hydrophilic regions, as 
well as regions with high antigenicity (see, for example, Figures 12 and 14). 
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An hVR-1, hVR-2, and rVR-2 immunogen typically is used to prepare antibodies 
by immunizing a suitable subject, {e.g., rabbit, goat, mouse or other mammal) with the 
immunogen. An appropriate immunogenic preparation can contain, for example, 
recombinantly expressed hVR-1, hVR-2, and rVR-2 protein or a chemically synthesized 
5 hVR-1, hVR-2. and rVR-2 polypeptide. The preparation can further include an 
adjuvant, such as Freund's complete or incomplete adjuvant, or similar 
immunostimulatory agent. Immunization of a suitable subject with an immunogenic 
hVR-1, hVR-2, and rVR-2 preparation induces a polyclonal anti-hVR-1, anti-hVR-2, 
and anti-rVR-2 antibody response. 
1 0 Accordingly, another aspect of the invention pertains to anti-hVR-1 , anti-hVR-2, 

and anti-rVR-2 antibodies. The term "antibody" as used herein refers to 
immunoglobulin molecules and immunologically active portions of immunoglobulin 
molecules, i.e., molecules that contain an antigen binding site which specifically binds 
(immunoreacts with) an antigen, such as hVR-1, hVR-2, and rVR-2. Examples of 
1 5 immunologically active portions of immunoglobulin molecules include F(ab) and 

F(ab')2 fragments which can be generated by treating the antibody with an enzyme such 
as pepsin. The invention provides polyclonal and monoclonal antibodies that bind hVR- 
1, hVR-2, and rVR-2. The term "monoclonal antibody" or "monoclonal antibody 
composition", as used herein, refers to a population of antibody molecules that contain 
20 only one species of an antigen binding site capable of immunoreacting with a particular 
epitope of hVR-1 , hVR-2, and rVR-2. A monoclonal antibody composition thus 
typically displays a single binding affinity for a particular hVR-1, hVR-2, and rVR-2 
protein with which it immunoreacts. 

Polyclonal anti-hVR-1, anti-hVR-2, and anti-rVR-2 antibodies can be prepared 
25 as described above by immunizing a suitable subject with an hVR-1 , hVR-2, and rVR-2 
immunogen. The anti-hVR-1, anti-hVR-2, and anti-rVR-2 antibody titer in the 
immunized subject can be monitored over time by standard techniques, such as with an 
enzyme linked immunosorbent assay (ELISA) using immobilized hVR-1, hVR-2, and 
rVR-2. If desired, the antibody molecules directed against hVR-1, hVR-2, and rVR-2 
30 can be isolated from the mammal (e.g. , from the blood) and further purified by well 

known techniques, such as protein A chromatography to obtain the IgG fraction. At an 



BNSDOCID: <WO 0029577A1JA> 



WO 00/29577 PCT/US99/26701 

-39- 

appropriate time after immunization, e.g., when the anti-hVR-1, anti-hVR-2, and anti- 
rVR-2 antibody titers are highest, antibody-producing cells can be obtained from the 
subject and used to prepare monoclonal antibodies by standard techniques, such as the 
hybridoma technique originally described by Kohler and Milstein (1975) Nature 
5 256:495-497) (see also, Brown et al (1981) J. Immunol 127:539-46; Brown et al 
(1980) J. Biol. Chem .255:4980-83; Yeh et al (1976) Proc. Natl Acad. Scl USA 
76:2927-3 1 ; and Yeh et al. (1982) Int. J. Cancer 29:269-75), the more recent human B 
cell hybridoma technique (Kozbor et al (1983) Immunol Today 4:72), the EBV- 
hybridoma technique (Cole et al (1985), Monoclonal Antibodies and Cancer Therapy, 
1 0 Alan R. Liss, Inc., pp. 77-96) or trioma techniques. The technology for producing 
monoclonal antibody hybridomas is well known (see generally R. H. Kenneth, in 
Monoclonal Antibodies: A New Dimension In Biological Analyses, Plenum Publishing 
Corp., New York, New York (1980); E. A. Lerner (1981) Yale J. Biol. Med, 54:387402; 
M. L. Gefter et al. (1977) Somatic Cell Genet. 3:23136). Briefly, an immortal cell line 

15 (typically a myeloma) is fused to lymphocytes (typically splenocytes) from a mammal 
immunized with an hVR-1, hVR-2, and rVR-2 immunogen as described above, and the 
culture supernatants of the resulting hybridoma cells are screened to identify a 
hybridoma producing a monoclonal antibody that binds hVR-1, hVR-2. and rVR-2. 
Any of the many well known protocols used for fusing lymphocytes and 

20 immortalized cell lines can be applied for the purpose of generating an anti-hVR-1, anti- 
hVR-2, and anti-rVR-2 monoclonal antibodies (see, e.g., G. Galfre et al (1977) Nature 
266:55052; Gefter et al Somatic Cell Genet., cited supra; Lerner, Yale J. Biol Med., 
cited supra; Kenneth, Monoclonal Antibodies, cited supra). Moreover, the ordinarily 
skilled worker will appreciate that there are many variations of such methods which also 

25 would be useful. Typically, the immortal cell line (e.g., a myeloma cell line) is derived 
from the same mammalian species as the lymphocytes. For example, murine 
hybridomas can be made by fusing lymphocytes from a mouse immunized with an 
immunogenic preparation of the present invention with an immortalized mouse cell line, 
immortal cell lines are mouse myeloma cell lines that are sensitive to culture medium 

30 containing hypoxanthine, aminopterin and thymidine ("HAT medium"). Any of a 
number of myeloma cell lines can be used as a ftision partner according to standard 
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techniques, e.g., the P3-NSl/l-Ag4-l, P3-x63-Ag8.653 or Sp2/0-Agl4 myeloma lines. 
These myeloma lines are available from ATCC. Typically, HAT-sensitive mouse 
myeloma cells are fused to mouse splenocytes using polyethylene glycol ("PEG"). 
Hybridoma cells resulting from the fusion are then selected using HAT medium, which 
5 kills unfused and unproductively fused myeloma cells (unfused splenocytes die after 
several days because they are not transformed). Hybridoma cells producing a 
monoclonal antibody of the invention are detected by screening the hybridoma culture 
supernatants for antibodies that bind hVR-1, hVR-2, and rVR-2, e.g., using a standard 
ELISA assay. 

1 0 Alternative to preparing monoclonal antibody-secreting hybridomas, a 

monoclonal anti-hVR-1, anti-hVR-2, and anti-rVR-2 antibody can be identified and 
isolated by screening a recombinant combinatorial immunoglobulin library {e.g., an 
antibody phage display library) with hVR-1, hVR-2, and rVR-2 to thereby isolate 
immunoglobulin library members that bind hVR-1 , hVR-2, and rVR-2. Kits for 

15 generating and screening phage display libraries are commercially available {e.g., the 
Pharmacia Recombinant Phage Antibody System, Catalog No. 27-9400-01 ; and the 
Stratagene SurfZAP™ Phage Display Kit, Catalog No. 240612). Additionally, examples 
of methods and reagents particularly amenable for use in generating and screening 
antibody display library can be found in, for example, Ladner et ah U.S. Patent No. 

20 5,223,409; Kang et ah PCT International Publication No. WO 92/18619; Dower et ah 
PCT International Publication No. WO 91/17271; Winter et ah PCT International 
Publication WO 92/20791; Markland et ah PCT International Publication No. WO 
92/15679; Breitling et ah PCT International Publication WO 93/01288; McCafferty et 
ah PCT International Publication No. WO 92/01047; Garrard et ah PCT International 

25 Publication No. WO 92/09690; Ladner et ah PCT International Publication No. WO 
90/02809; Fuchs et ah (1991) Bio/Technology 9:1370-1372; Hay et ah (1992) Hum. 
Antibod. Hybridomas 3:81-85; Huse et ah (1989) Science 246:1275-1281; Griffiths et ah 
(1993) EMBO J 12:725-734; Hawkins et ah (1992)7. Moh Bioh 226:889-896; Clarkson 
et ah (1991) Nature 352:624-628; Gram et ah (1992) Proc. Nath Acad. ScL USA 

30 89:3576-3580; Garrad et ah (1991) Bio/Technology 9:1373-1377; Hoogenboom et ah 
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(1991) Nuc. Acid Res. 19:4133-4137; Barbas et al (1991) /W. Natl. Acad. Sci. USA 
88:7978-7982; and McCafferty et al. Nature (1990) 348:552-554. 

Additionally, recombinant anti-hVR-1, anti-hVR-2, and anti-rVR-2 antibodies, 
such as chimeric and humanized monoclonal antibodies, comprising both human and 
5 non-human portions, which can be made using standard recombinant DNA techniques, 
are within the scope of the invention. Such chimeric and humanized monoclonal 
antibodies can be produced by recombinant DNA techniques known in the art, for 
example using methods described in Robinson et al. International Application No. 
PCT/US86/02269; Akira, et al. European Patent Application 184,187; TaniguchL M., 

10 European Patent Application 171,496; Morrison et al. European Patent Application 
1 73,494; Neuberger et al. PCT International Publication No. WO 86/01533; Cabilly et 
al U.S. Patent No. 4,816,567; Cabilly et al European Patent Application 125,023; 
Better et al. (1988) Science 240:1041-1043; Liu et al. (1987) Proc. Natl. Acad Sci. USA 
84:3439-3443; Liu et al (1987) J. Immunol 139:3521-3526; Sun et al (1987) Proc. 

15 Natl Acad. Sci. USA 84:214-21 8; Nishimura et al (1987) Cane. Res. 47:999-1005; 
Wood etal. (1985) Nature 314:446-449; and Shaw et al (1988) J. Natl Cancer Inst. 
80:1553-1559); Morrison, S. L. (1985) Science 229:1202-1207; Oi et al. (1986) 
BioTechniques 4:214; Winter U.S. Patent 5,225,539; Jones et al (1986) Nature 
321:552-525; Verhoeyan et al (1988) Science 239:1534; and Beidler et al (1988) J. 

20 Immunol 141:4053-4060. 

An anti-hVR-1, anti-hVR-2, and anti-rVR-2 antibody (e.g., monoclonal 
antibody) can be used to isolate hVR-1, hVR-2, and rVR-2 by standard techniques, such 
as affinity chromatography or immunoprecipitation. An anti-hVR-1, anti-hVR-2, and 
anti-rVR-2 antibody can facilitate the purification of natural hVR-1, hVR-2, and rVR-2 

25 from cells and of recombinantly produced hVR-1, hVR-2, and rVR-2 expressed in host 
cells. Moreover, an anti-hVR-1, anti-hVR-2, and anti-rVR-2 antibody can be used to 
detect hVR-1, hVR-2, and rVR-2 protein (e.g., in a cellular lysate or cell supernatant) in 
order to evaluate the abundance and pattern of expression of the hVR-1, hVR-2, and 
rVR-2 protein. Anti-hVR-1, anti-hVR-2, and anti-rVR-2 antibodies can be used 

30 diagnostically to monitor protein levels in tissue as part of a clinical testing procedure, 
e.g., to, for example, determine the efficacy of a given treatment regimen. Detection can 
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be facilitated by coupling (i.e., physically linking) the antibody to a detectable 
substance. Examples of detectable substances include various enzymes, prosthetic 
groups, fluorescent materials, luminescent materials, bioluminescent materials, and 
radioactive materials. Examples of suitable enzymes include horseradish peroxidase, 
5 alkaline phosphatase, -galactosidase, or acetylcholinesterase; examples of suitable 
prosthetic group complexes include streptavidin/biotin and avidin/biotin; examples of 
suitable fluorescent materials include umbelliferone, fluorescein, fluorescein 
isothiocyanate, rhodamine, dichlorotriazinylamine fluorescein, dansyl chloride or 
phycoerythrin; an example of a luminescent material includes luminol; examples of 
1 0 bioluminescent materials include luciferase, luciferin, and aequorin, and examples of 
suitable radioactive material include I25 I, I3I I ? 35$ G r 3 H. 



III. Recombinant Expression Vectors and Host Cells 

Another aspect of the invention pertains to vectors, preferably expression 
1 5 vectors, containing a nucleic acid encoding an h VR- 1 , hVR-2, and rVR-2 protein (or a 
portion thereof). As used herein, the term "vector" refers to a nucleic acid molecule 
capable of transporting another nucleic acid to which it has been linked. One type of 
vector is a "plasmid", which refers to a circular double stranded DNA loop into which 
additional DNA segments can be ligated. Another type of vector is a viral vector, 
20 wherein additional DNA segments can be ligated into the viral genome. Certain vectors 
are capable of autonomous replication in a host cell into which they are introduced (e.g., 
bacterial vectors having a bacterial origin of replication and episomal mammalian 
vectors). Other vectors (e.g., non-episomal mammalian vectors) are integrated into the 
genome of a host cell upon introduction into the host cell, and thereby are replicated 
25 along with the host genome. Moreover, certain vectors are capable of directing the 
expression of genes to which they are operatively linked. Such vectors are referred to 
herein as "expression vectors". In general, expression vectors of utility in recombinant 
DNA techniques are often in the form of plasmids. In the present specification, 
"plasmid" and "vector" can be used interchangeably as the plasmid is the most 
30 commonly used form of vector. However, the invention is intended to include such 
other forms of expression vectors, such as viral vectors (e.g., replication defective 
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retroviruses, adenoviruses and adeno-associated viruses), which serve equivalent 
functions. 

The recombinant expression vectors of the invention comprise a nucleic acid of 
the invention in a form suitable for expression of the nucleic acid in a host cell, which 
5 means that the recombinant expression vectors include one or more regulatory 

sequences, selected on the basis of the host cells to be used for expression, which is 
operatively linked to the nucleic acid sequence to be expressed. Within a recombinant 
expression vector, "operably linked" is intended to mean that the nucleotide sequence of 
interest is linked to the regulatory sequence(s) in a manner which allows for expression 
10 of the nucleotide sequence (e.g., in an in vitro transcription/translation system or in a 
host cell when the vector is introduced into the host cell). The term "regulatory 
sequence" is intended to include promoters, enhancers and other expression control 
elements (e.g., polyadenylation signals). Such regulatory sequences are described, for 
example, in Goeddel; Gene Expression Technology; Methods in Enzymology 185, 

15 Academic Press, San Diego, CA (1990). Regulatory sequences include those which 
direct constitutive expression of a nucleotide sequence in many types of host cells and 
those which direct expression of the nucleotide sequence only in certain host cells (e.g, 
tissue-specific regulatory sequences). It will be appreciated by those skilled in the art 
that the design of the expression vector can depend on such factors as the choice of the 

20 host cell to be transformed, the level of expression of protein desired, and the like. The 
expression vectors of the invention can be introduced into host cells to thereby produce 
proteins or peptides, including fusion proteins or peptides, encoded by nucleic acids as 
described herein (e.g, hVR-1, hVR-2, and rVR-2 proteins, mutant forms of hVR-1, 
hVR-2, and rVR-2 proteins, fusion proteins, and the like). 

25 The recombinant expression vectors of the invention can be designed for 

expression of hVR-1, hVR-2, and rVR-2 proteins in prokaryotic or eukaryotic cells. For 
example, hVR-1, hVR-2, and rVR-2 proteins can be expressed in bacterial ceils such as 
£. coli, insect cells (using baculovirus expression vectors) yeast cells or mammalian 
cells. Suitable host cells are discussed further in Goeddel, Gene Expression Technology: 

30 Methods in Enzymology 185, Academic Press, San Diego, CA (1990). Alternatively, the 
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recombinant expression vector can be transcribed and translated in vitro, for example 
using T7 promoter regulatory sequences and T7 polymerase. 

Expression of proteins in prokaryotes is most often carried out in E. coli with 
vectors containing constitutive or inducible promoters directing the expression of either 
5 fusion or non-fusion proteins. Fusion vectors add a number of amino acids to a protein 
encoded therein, usually to the amino terminus of the recombinant protein. Such fusion 
vectors typically serve three purposes: 1) to increase expression of recombinant protein; 
2) to increase the solubility of the recombinant protein; and 3) to aid in the purification 
of the recombinant protein by acting as a ligand in affinity purification. Often, in fusion 
10 expression vectors, a proteolytic cleavage site is introduced at the junction of the fusion 
moiety and the recombinant protein to enable separation of the recombinant protein from 
the fusion moiety subsequent to purification of the fusion protein. Such enzymes, and 
their cognate recognition sequences, include Factor Xa, thrombin and enterokinase. 
Typical fusion expression vectors include pGEX (Pharmacia Biotech Inc; Smith, D.B. 
15 and Johnson, K.S. (1988) Gene 67:31-40), pMAL (New England Biolabs, Beverly, MA) 
and pRIT5 (Pharmacia, Piscataway, NJ) which fuse glutathione S-transferase (GST), 
maltose E binding protein, or protein A, respectively, to the target recombinant protein. 

Purified fusion proteins can be utilized in hVR-1, hVR-2, and rVR-2 activity 
assays, (e.g., direct assays or competitive assays described in detail below), or to, for 
20 example, generate antibodies specific for hVR-1 , hVR-2, and rVR-2 proteins. In a 
embodiment, an hVR-1, hVR-2, and rVR-2 fusion protein expressed in a retroviral 
expression vector of the present invention can be utilized to infect bone marrow cells 
which are subsequently transplanted into irradiated recipients. The pathology of the 
subject recipient is then examined after sufficient time has passed (e.g., six (6) weeks). 
25 Examples of suitable inducible non-fiision E. coli expression vectors include 

pTrc (Amann et al. 9 (1988) Gene 69:301-315) and pET 1 Id (Studier et al. 9 Gene 
Expression Technology: Methods in Enzymology 185, Academic Press, San Diego, 
California (1990) 60-89). Target gene expression from the pTrc vector relies on host 
RNA polymerase transcription from a hybrid trp-lac fusion promoter. Target gene 
30 expression from the pET 1 Id vector relies on transcription from a T7 gnlO-lac fusion 
promoter mediated by a coexpressed viral RNA polymerase (T7 gnl). This viral 
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polymerase is supplied by host strains BL21(DE3) or HMS1 74(DE3) from a resident 
prophage harboring a T7 gnl gene under the transcriptional control of the lacUV 5 
promoter. 

One strategy to maximize recombinant protein expression in E. coli is to express 
5 the protein in a host bacteria with an impaired capacity to proteolytically cleave the 
recombinant protein (Gottesman, S., Gene Expression Technology: Methods in 
Enzymology 185, Academic Press, San Diego, California (1990) 1 19-128). Another 
strategy is to alter the nucleic acid sequence of the nucleic acid to be inserted into an 
expression vector so that the individual codons for each amino acid are those 
10 preferentially utilized in E. coli (Wada et ai, (1992) Nucleic Acids Res. 20:21 1 1-2118). 
Such alteration of nucleic acid sequences of the invention can be carried out by standard 
DNA synthesis techniques. 

In another embodiment, the hVR-1, hVR-2, and rVR-2 expression vector is a 
yeast expression vector. Examples of vectors for expression in yeast S. cerivisae include 
15 pYepSccl (Baldari, et aL, (1987) Embo J. 6:229-234), pMFa (Kurjan and Herskowitz, 

(1982) Cell 30:933-943), pJRY88 (Schultz et al. 9 (1987) Gene 54:1 13-123), pYES2 
(Invitrogen Corporation, San Diego, CA), and picZ (InVitrogen Corp, San Diego, CA). 

Alternatively, hVR-1, hVR-2, and rVR-2 proteins can be expressed in insect 
cells using baculovirus expression vectors. Baculovirus vectors available for expression 
20 of proteins in cultured insect cells (e.g., Sf 9 cells) include the pAc series (Smith et al 

(1983) Moi Cell Biol. 3:2156-2165) and the pVL series (Lucklow and Summers (1989) 
Virology 170:31-39). 

In yet another embodiment, a nucleic acid of the invention is expressed in 
mammalian cells using a mammalian expression vector. Examples of mammalian 

25 expression vectors include pCDM8 (Seed, B. (1987) Nature 329:840) and pMT2PC 
(Kaufman <?/ a/. (\9S7)EMBOJ. 6:187-195). When used in mammalian cells, the 
expression vector's control functions are often provided by viral regulatory elements. 
For example, commonly used promoters are derived from polyoma, Adenovirus 2, 
cytomegalovirus and Simian Virus 40. For other suitable expression systems for both 

30 prokaryotic and eukaryotic cells see chapters 16 and 1 7 of Sambrook, J., Fritsh, E. F., 
and Maniatis, T. Molecular Cloning: A Laboratory Manual, 2nd, ed t Cold Spring 
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Harbor Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 
1989. 

In another embodiment, the recombinant mammalian expression vector is 
capable of directing expression of the nucleic acid preferentially in a particular cell type 
5 (e.g., tissue-specific regulatory elements are used to express the nucleic acid). Tissue- 
specific regulatory elements are known in the art. Non-limiting examples of suitable 
tissue-specific promoters include the albumin promoter (liver-specific; Pinkert et al 
(1987) Genes Dev. 1 :268-277), lymphoid-specific promoters (Calame and Eaton (1988) 
Adv. Immunol. 43:235-275), in particular promoters of T cell receptors (Winoto and 
10 Baltimore (1989) EMBOJ. 8:729-733) and immunoglobulins (Banerji et al. (1983) Cell 
33:729-740; Queen and Baltimore (1983) Cell 33:741-748), neuron-specific promoters 
(e.g., the neurofilament promoter; Byrne and Ruddle (1989) Proc. Natl Acad. Sci. USA 
86:5473-5477), pancreas-specific promoters (Edlund et al. (1985) Science 230:912-916), 
and mammary gland-specific promoters (e.g., milk whey promoter; U.S. Patent No. 
1 5 4,873,3 1 6 and European Application Publication No. 264, 1 66). Developmentally- 
regulated promoters are also encompassed, for example the murine hox promoters 
(Kessel and Gruss (1990) Science 249:374-379) and the ct-fetoprotein promoter 
(Campes and Tilghman (1989) Genes Dev. 3:537-546). 

The expression characteristics of an endogenous hVR-1, hVR-2, and rVR-2 gene 
20 within a cell line or microorganism may be modified by inserting a heterologous DNA 
regulatory element into the genome of a stable cell line or cloned microorganism such 
that the inserted regulatory element is operatively linked with the endogenous hVR-1, 
hVR-2, and rVR-2 gene. For example, an endogenous hVR-1, hVR-2, and rVR-2 gene 
which is normally "trancriptionally silent", i.e., a hVR-1, hVR-2, and rVR-2 gene which 
25 is normally not expressed, or is expressed only at very low levels in a cell line or 

microorganism, may be activated by inserting a regulatory element which is capable of 
promoting the expression of a normally expressed gene product in that cell line or 
microorganism. Alternatively, a transcriptionally silent, endogenous hVR-1, hVR-2, 
and rVR-2 gene, may be activated by insertion of a promiscuous regulatory element that 
30 works across cell types. 
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A heterologous regulatory element may be inserted into a stable cell line or 
cloned microorganism, such that it is operatively linked with an endogenous hVR-1, 
hVR-2, and rVR-2 gene, using techniques, such as targeted homologous recombination, 
which are well known to those of skill in the art, and described e.g., in Chappel, U.S. 
5 Patent No.: 5,272,071; PCT publication No. WO 91/06667, published May 16, 1991. 

The invention further provides a recombinant expression vector comprising a 
DNA molecule of the invention cloned into the expression vector in an antisense 
orientation. That is, the DNA molecule is operatively linked to a regulatory sequence in 
a manner which allows for expression (by transcription of the DNA molecule) of an 

10 RNA molecule which is antisense to hVR-1, hVR-2, and rVR-2 mRNA. Regulatory 
sequences operatively linked to a nucleic acid cloned in the antisense orientation can be 
chosen which direct the continuous expression of the antisense RNA molecule in a 
variety of cell types, for instance viral promoters and/or enhancers, or regulatory 
sequences can be chosen which direct constitutive, tissue specific or cell type specific 

1 5 expression of antisense RNA. The antisense expression vector can be in the form of a 
recombinant plasmid, phagemid or attenuated virus in which antisense nucleic acids are 
produced under the control of a high efficiency regulatory region, the activity of which 
can be determined by the cell type into which the vector is introduced. For a discussion 
of the regulation of gene expression using antisense genes see Weintraub, H. et al, 

20 Antisense RNA as a molecular tool for genetic analysis, Reviews - Trends in Genetics, 
Vol. 1(1) 1986. 

Another aspect of the invention pertains to host cells into which an hVR-1, hVR- 
2, and rVR-2 nucleic acid molecule of the invention is introduced, e.g., an hVR-1, hVR- 
2, and rVR-2 nucleic acid molecule within a recombinant expression vector or an hVR- 

25 1, hVR-2, and rVR-2 nucleic acid molecule containing sequences which allow it to 

homologously recombine into a specific site of the host cell's genome. The terms "host 
cell" and "recombinant host cell" are used interchangeably herein. It is understood that 
such terms refer not only to the particular subject cell but to the progeny or potential 
progeny of such a cell. Because certain modifications may occur in succeeding 

30 generations due to either mutation or environmental influences, such progeny may not, 



BNSDOCtD: <WO 0029577A1JA> 



WO 00/29577 PCT/US99/26701 

-48 - 

in fact, be identical to the parent cell, but are still included within the scope of the term 
as used herein. 

A host cell can be any prokaryotic or eukaryotic cell. For example, an hVR-1 , 
hVR-2, and rVR-2 protein can be expressed in bacterial cells such as E. coli, insect cells, 
5 yeast or mammalian cells (such as Chinese hamster ovary cells (CHO) or COS cells). 
Other suitable host cells are known to those skilled in the art. 

Vector DNA can be introduced into prokaryotic or eukaryotic cells via 
conventional transformation or transfection techniques. As used herein, the terms 
"transformation" and "transfection" are intended to refer to a variety of art-recognized 
10 techniques for introducing foreign nucleic acid {e.g., DNA) into a host cell, including 
calcium phosphate or calcium chloride co-precipitation, DEAE-dextran-mediated 
transfection, lipofection, or electroporation. Suitable methods for transforming or 
iransfecting host cells can be found in Sambrook, et al {Molecular Cloning: A 
Laboratory Manual 2nd, ed, Cold Spring Harbor Laboratory, Cold Spring Harbor 
1 5 Laboratory Press, Cold Spring Harbor, NY, 1 989), and other laboratory manuals. 

For stable transfection of mammalian cells, it is known that, depending upon the 
expression vector and transfection technique used, only a small fraction of cells may 
integrate the foreign DNA into their genome. In order to identify and select these 
integrants, a gene that encodes a selectable marker {e.g., resistance to antibiotics) is 
20 generally introduced into the host cells along with the gene of interest, selectable 

markers include those which confer resistance to drugs, such as G418, hygromycin and 
methotrexate. Nucleic acid encoding a selectable marker can be introduced into a host 
cell on the same vector as that encoding an hVR-1, hVR-2, and rVR-2 protein or can be 
introduced on a separate vector. Cells stably transfected with the introduced nucleic 
25 acid can be identified by drug selection {e.g y cells that have incorporated the selectable 
marker gene will survive, while the other cells die). 

A host cell of the invention, such as a prokaryotic or eukaryotic host cell in 
culture, can be used to produce (i.e., express) an hVR-1, hVR-2, and rVR-2 protein. 
Accordingly, the invention further provides methods for producing an hVR-1, hVR-2, 
30 and rVR-2 protein using the host cells of the invention. In one embodiment, the method 
comprises culturing the host cell of invention (into which a recombinant expression 
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vector encoding an hVR-1, hVR-2, and rVR-2 protein has been introduced) in a suitable 
medium such that an hVR-1, hVR-2, and rVR-2 protein is produced. In another 
embodiment, the method further comprises isolating an hVR-1, hVR-2, and rVR-2 
protein from the medium or the host cell. 
5 The host cells of the invention can also be used to produce non-human transgenic 

animals. For example, in one embodiment, a host cell of the invention is a fertilized 
oocyte or an embryonic stem cell into which hVR-1, hVR-2, and rVR-2-coding 
sequences have been introduced. Such host cells can then be used to create non-human 
transgenic animals in which exogenous hVR-1, hVR-2, and rVR-2 sequences have been 

10 introduced into their genome or homologous recombinant animals in which endogenous 
hVR-1, hVR-2, and rVR-2 sequences have been altered. Such animals are useful for 
studying the function and/or activity of an hVR-1, hVR-2, and rVR-2 and for identifying 
and/or evaluating modulators of hVR-1, hVR-2, and rVR-2 activity. As used herein, a 
"transgenic animal" is a non-human animal, preferably a mammal, more preferably a 

15 rodent such as a rat or mouse, in which one or more of the cells of the animal includes a 
transgene. Other examples of transgenic animals include non-human primates, sheep, 
dogs, cows, goats, chickens, amphibians, and the like. A transgene is exogenous DNA 
which is integrated into the genome of a cell from which a transgenic animal develops 
and which remains in the genome of the mature animal, thereby directing the expression 

20 of an encoded gene product in one or more cell types or tissues of the transgenic animal. 
As used herein, a "homologous recombinant animal" is a non-human animal, preferably 
a mammal, more preferably a mouse, in which an endogenous hVR-1, hVR-2, and rVR- 
2 gene has been altered by homologous recombination between the endogenous gene 
and an exogenous DNA molecule introduced into a cell of the animal, e.g., an 

25 embryonic cell of the animal, prior to development of the animal. 

A transgenic animal of the invention can be created by introducing an hVR-1, 
hVR-2, and rVR-2-encoding nucleic acid into the male pronuclei of a fertilized oocyte, 
e.g., by microinjection, retroviral infection, and allowing the oocyte to develop in a 
pseudopregnant female foster animal. The hVR-1, hVR-2, and rVR-2 cDNA sequence 

30 of SEQ ID NO:l, 3, 5, 7 or 9 can be introduced as a transgene into the genome of a non- 
human animal. Alternatively, a nonhuman homologue of a hVR-2 gene, such as a 
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mouse or rat hVR-2, e.g., the rVR-2 gene, can be used as a transgene. Alternatively, an 
hVR-1, hVR-2, and rVR-2 gene homologue, such as another member of the 
Capsaicin/Vanilloid family, can be isolated based on hybridization to the hVR-1, hVR-2, 
and rVR-2 cDNA sequences of SEQ ID NO:l, 3, 4, 6, 7, 9, 10, or 12, (described further 
5 in subsection I above) and used as a transgene. Intronic sequences and polyadenylation 
signals can also be included in the transgene to increase the efficiency of expression of 
the transgene. A tissue-specific regulatory sequence(s) can be operably linked to an 
hVR-1, hVR-2, and rVR-2 transgene to direct expression of an hVR-K hVR-2, and rVR- 
2 protein to particular cells. Methods for generating transgenic animals via embryo 
10 manipulation and microinjection, particularly animals such as mice, have become 

conventional in the art and are described, for example, in U.S. Patent Nos. 4,736,866 and 
4 ? 870,009, both by Leder et al, U.S. Patent No. 4,873,191 by Wagner et al and in 
Hogan, B., Manipulating the Mouse Embryo, (Cold Spring Harbor Laboratory Press, 
Cold Spring Harbor, N.Y., 1986). Similar methods are used for production of other 
1 5 transgenic animals. A transgenic founder animal can be identified based upon the 

presence of an hVR-1, hVR-2, and rVR-2 transgene in its genome and/or expression of 
hVR-1, hVR-2, and rVR-2 mRNA in tissues or cells of the animals. A transgenic 
founder animal can then be used to breed additional animals carrying the transgene. 
Moreover, transgenic animals carrying a transgene encoding an hVR-1, hVR-2, and 
20 rVR-2 protein can further be bred to other transgenic animals carrying other transgenes. 

To create a homologous recombinant animal, a vector is prepared which contains 
at least a portion of an hVR-1, hVR-2, and rVR-2 gene into which a deletion, addition or 
substitution has been introduced to thereby alter, e.g., functionally disrupt, the hVR-1, 
hVR-2, and rVR-2 gene. The VR-1 or VR-2 gene can be a human gene (e.g., the cDNA 
25 of SEQ ID NO: 1,3,5, 4, 6, 7, or 9), but more preferably, is a non-human homologue of 
a hVR-1 and hVR-2 gene (e.g., the cDNA of SEQ ID NO: 10 or 12, or a cDNA isolated 
by stringent hybridization with the nucleotide sequence of SEQ ID NO: 1, 3, 5, 4, 6, 7, 
or 9). For example, a mouse VR-2 gene can be used to construct a homologous 
recombination nucleic acid molecule, e.g., a vector, suitable for altering an endogenous 
30 VR-2 gene in the mouse genome. In a embodiment, the homologous recombination 
nucleic acid molecule is designed such that, upon homologous recombination, the 
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endogenous hVR-1, hVR-2, and rVR-2 gene is functionally disrupted (i.e., no longer 
encodes a functional protein; also referred to as a "knock out" vector). Alternatively, the 
homologous recombination nucleic acid molecule can be designed such that, upon 
homologous recombination, the endogenous hVR-1, hVR-2, and rVR-2 gene is mutated 
5 or otherwise altered but still encodes functional protein (e.g., the upstream regulatory 
region can be altered to thereby alter the expression of the endogenous hVR-1, hVR-2, 
and rVR-2 protein). In the homologous recombination nucleic acid molecule, the altered 
portion of the hVR-1, hVR-2, and rVR-2 gene is flanked at its 5' and 3' ends by 
additional nucleic acid sequence of the hVR-1 , hVR-2, and rVR-2 gene to allow for 
10 homologous recombination to occur between the exogenous hVR-1, hVR-2, and rVR-2 
gene carried by the homologous recombination nucleic acid molecule and an 
endogenous hVR-1, hVR-2, and rVR-2 gene in a cell, e.g., an embryonic stem cell. The 
additional flanking hVR-1, hVR-2, and rVR-2 nucleic acid sequence is of sufficient 
length for successful homologous recombination with the endogenous gene. Typically, 
15 several kilobases of flanking DNA (both at the 5' and 3' ends) are included in the 
homologous recombination nucleic acid molecule (see, e.g., Thomas, K.R. and 
Capecchi, M. R. (1987) Cell 51:503 for a description of homologous recombination 
vectors). The homologous recombination nucleic acid molecule is introduced into a cell, 
e.g., an embryonic stem cell line (e.g., by electroporation) and cells in which the 
20 introduced hVR-1, hVR-2, and rVR-2 gene has homologously recombined with the 
endogenous hVR-1, hVR-2, and rVR-2 gene are selected (see e.g., Li, E. et al (1992) 
Cell 69:915). The selected cells can then injected into a blastocyst of an animal (e.g., a 
mouse) to form aggregation chimeras (see e.g., Bradley, A. in Teratocarcinomas and 
Embryonic Stem Cells: A Practical Approach, E.J. Robertson, ed. (IRL, Oxford, 1987) 
25 pp. 1 13-152). A chimeric embryo can then be implanted into a suitable pseudopregnant 
female foster animal and the embryo brought to term. Progeny harboring the 
homologously recombined DNA in their germ cells can be used to breed animals in 
which all cells of the animal contain the homologously recombined DNA by germline 
transmission of the transgene. Methods for constructing homologous recombination 
30 nucleic acid molecules, e.g., vectors, or homologous recombinant animals are described 
further in Bradley, A. (1991) Current Opinion in Biotechnology 2:823-829 and in PCT 
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International Publication Nos.: WO 90/1 1354 by Le Mouellec et al ; WO 91/01 140 by 
Smithies et al; WO 92/0968 by Zijlstra et al; and WO 93/04169 by Berns et al 

In another embodiment, transgenic non-humans animals can be produced which 
contain selected systems which allow for regulated expression of the transgene. One 
5 example of such a system is the cre/loxP recombinase system of bacteriophage PI . For 
a description of the cre/loxP recombinase system, see, e.g., Lakso et al (1992) Proc. 
Natl Acad ScL USA 89:6232-6236. Another example of a recombinase system is the 
FLP recombinase system of Saccharomyces cerevisiae (O'Gorman et al (1991) Science 
251:1351-1355. If a cre/loxP recombinase system is used to regulate expression of the 
10 transgene, animals containing transgenes encoding both the Cre recombinase and a 

selected protein are required. Such animals can be provided through the construction of 
"double'* transgenic animals, e.g., by mating two transgenic animals, one containing a 
transgene encoding a selected protein and the other containing a transgene encoding a 
recombinase. 

1 5 Clones of the non-human transgenic animals described herein can also be 

produced according to the methods described in Wilmut, I. et al (1997) Nature 385:810- 
813 and PCT International Publication Nos. WO 97/07668 and WO 97/07669. In brief, 
a cell, e.g., a somatic cell, from the transgenic animal can be isolated and induced to exit 
the growth cycle and enter G 0 phase. The quiescent cell can then be fused, e.g., through 

20 the use of electrical pulses, to an enucleated oocyte from an animal of the same species 
from which the quiescent cell is isolated. The recontructed oocyte is then cultured such 
that it develops to morula or blastocyte and then transferred to pseudopregnant female 
foster animal. The offspring borne of this female foster animal will be a clone of the 
animal from which the cell, e.g., the somatic cell, is isolated. 

25 

IV. Pharmaceutical Compositions 

The hVR-1, hVR-2, and rVR-2 nucleic acid molecules, fragments of hVR-1, 
hVR-2, and rVR-2 proteins, and anti-hVR-1, anti-hVR-2, and anti-rVR-2 antibodies 
(also referred to herein as "active compounds") of the invention can be incorporated into 
30 pharmaceutical compositions suitable for administration. Such compositions typically 
comprise the nucleic acid molecule, protein, or antibody and a pharmaceutical ly 
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acceptable carrier. As used herein the language "pharmaceutically acceptable carrier" is 
intended to include any and all solvents, dispersion media, coatings, antibacterial and 
antifungal agents, isotonic and absorption delaying agents, and the like, compatible with 
pharmaceutical administration. The use of such media and agents for pharmaceutically 
5 active substances is well known in the art. Except insofar as any conventional media or 
agent is incompatible with the active compound, use thereof in the compositions is 
contemplated. Supplementary active compounds can also be incorporated into the 
compositions. 

A pharmaceutical composition of the invention is formulated to be compatible 
10 with its intended route of administration. Examples of routes of administration include 
parenteral, e.g., intravenous, intradermal, subcutaneous, oral (e.g., inhalation), 
transdermal (topical), transmucosal, and rectal administration. Solutions or suspensions 
used for parenteral, intradermal, or subcutaneous application can include the following 
components: a sterile diluent such as water for injection, saline solution, fixed oils, 
15 polyethylene glycols, glycerine, propylene glycol or other synthetic solvents; 

antibacterial agents such as benzyl alcohol or methyl parabens; antioxidants such as 
ascorbic acid or sodium bisulfite; chelating agents such as ethylenediaminetetraacetic 
acid; buffers such as acetates, citrates or phosphates and agents for the adjustment of 
tonicity such as sodium chloride or dextrose. pH can be adjusted with acids or bases, 
20 such as hydrochloric acid or sodium hydroxide. The parenteral preparation can be 
enclosed in ampoules, disposable syringes or multiple dose vials made of glass or 
plastic. 

Pharmaceutical compositions suitable for injectable use include sterile aqueous 
solutions (where water soluble) or dispersions and sterile powders for the 

25 extemporaneous preparation of sterile injectable solutions or dispersion. For 

intravenous administration, suitable carriers include physiological saline, bacteriostatic 
water, Cremophor EL™ (BASF, Parsippany, NJ) or phosphate buffered saline (PBS). In 
all cases, the composition must be sterile and should be fluid to the extent that easy 
syringability exists. It must be stable under the conditions of manufacture and storage 

30 and must be preserved against the contaminating action of microorganisms such as 
bacteria and fungi. The carrier can be a solvent or dispersion medium containing, for 



BNSDOCID: <WO 0029577A1 IA> 



WO 00/29577 PCT/US99/26701 

-54- 

example, water, ethanol, polyol (for example, glycerol, propylene glycol, and liquid 
polyetheylene glycol, and the like), and suitable mixtures thereof. The proper fluidity 
can be maintained, for example, by the use of a coating such as lecithin, by the 
maintenance of the required particle size in the case of dispersion and by the use of 
5 surfactants. Prevention of the action of microorganisms can be achieved by various 
antibacterial and antifungal agents, for example, parabens, chlorobutanol, phenol, 
ascorbic acid, thimerosal, and the like. In many cases, it will be preferable to include 
isotonic agents, for example, sugars, poly alcohols such as manitol, sorbitol, sodium 
chloride in the composition. Prolonged absorption of the injectable compositions can be 
1 0 brought about by including in the composition an agent which delays absorption, for 
example, aluminum monostearate and gelatin. 

Sterile injectable solutions can be prepared by incorporating the active 
compound (e.g., a fragment of an hVR-1, hVR-2, and rVR-2 protein or an anti-hVR-1, 
anti-hVR-2, and anti-rVR-2 antibody) in the required amount in an appropriate solvent 
15 with one or a combination of ingredients enumerated above, as required, followed by 
filtered sterilization. Generally, dispersions are prepared by incorporating the active 
compound into a sterile vehicle which contains a basic dispersion medium and the 
required other ingredients from those enumerated above. In the case of sterile powders 
for the preparation of sterile injectable solutions, the methods of preparation are vacuum 
20 drying and freeze-drying which yields a powder of the active ingredient plus any 
additional desired ingredient from a previously sterile-filtered solution thereof. 

Oral compositions generally include an inert diluent or an edible carrier. They 
can be enclosed in gelatin capsules or compressed into tablets. For the purpose of oral 
therapeutic administration, the active compound can be incorporated with excipients and 
25 used in the form of tablets, troches, or capsules, oral compositions can also be prepared 
using a fluid carrier for use as a mouthwash, wherein the compound in the fluid carrier is 
applied orally and swished and expectorated or swallowed. Pharmaceutically 
compatible binding agents, and/or adjuvant materials can be included as part of the 
composition. The tablets, pills, capsules, troches and the like can contain any of the 
30 following ingredients, or compounds of a similar nature: a binder such as 

microcrystalline cellulose, gum tragacanth or gelatin; an excipient such as starch or 
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lactose, a disintegrating agent such as alginic acid, Primogel, or corn starch; a lubricant 
such as magnesium stearate or Sterotes; a glidant such as colloidal silicon dioxide; a 
sweetening agent such as sucrose or saccharin; or a flavoring agent such as peppermint, 
methyl salicylate, or orange flavoring. 
5 For administration by inhalation, the compounds are delivered in the form of an 

aerosol spray from pressured container or dispenser which contains a suitable propellant, 
e.g., a gas such as carbon dioxide, or a nebulizer. 

Systemic administration can also be by transmucosal or transdermal means. For 
transmucosal or transdermal administration, penetrants appropriate to the barrier to be 

10 permeated are used in the formulation. Such penetrants are generally known in the art, 
and include, for example, for transmucosal administration, detergents, bile salts, and 
fusidic acid derivatives. Transmucosal administration can be accomplished through the 
use of nasal sprays or suppositories. For transdermal administration, the active 
compounds are formulated into ointments, salves, gels, or creams as generally known in 

15 the art. 

The compounds can also be prepared in the form of suppositories (e.g., with 
conventional suppository bases such as cocoa butter and other glycerides) or retention 
enemas for rectal delivery. 

In one embodiment, the active compounds are prepared with carriers that will 

20 protect the compound against rapid elimination from the body, such as a controlled 
release formulation, including implants and microencapsulated delivery systems. 
Biodegradable, biocompatible polymers can be used, such as ethylene vinyl acetate, 
polyanhydrides, polyglycolic acid, collagen, polyorthoesters, and poly lactic acid. 
Methods for preparation of such formulations will be apparent to those skilled in the art. 

25 The materials can also be obtained commercially from Alza Corporation and Nova 

Pharmaceuticals, Inc. Liposomal suspensions (including liposomes targeted to infected 
cells with monoclonal antibodies to viral antigens) can also be used as pharmaceutical^ 
acceptable carriers. These can be prepared according to methods known to those skilled 
in the art, for example, as described in U.S. Patent No. 4,522,81 1. 
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It is especially advantageous to formulate oral or parenteral compositions in 
dosage unit form for ease of administration and uniformity of dosage. Dosage unit form 
as used herein refers to physically discrete units suited as unitary dosages for the subject 
to be treated; each unit containing a predetermined quantity of active compound 
5 calculated to produce the desired therapeutic effect in association with the required 
pharmaceutical carrier. The specification for the dosage unit forms of the invention are 
dictated by and directly dependent on the unique characteristics of the active compound 
and the particular therapeutic effect to be achieved, and the limitations inherent in the art 
of compounding such an active compound for the treatment of individuals. 
10 Toxicity and therapeutic efficacy of such compounds can be determined by 

standard pharmaceutical procedures in cell cultures or experimental animals, e.g., for 
determining the LD50 (the dose lethal to 50% of the population) and the ED50 (the dose 
therapeutically effective in 50% of the population). The dose ratio between toxic and 
therapeutic effects is the therapeutic index and it can be expressed as the ratio 
15 LD50/ED50. Compounds which exhibit large therapeutic indices are preferred. While 
compounds that exhibit toxic side effects may be used, care should be taken to design a 
delivery system that targets such compounds to the site of affected tissue in order to 
minimize potential damage to uninfected cells and, thereby, reduce side effects. 

The data obtained from the cell culture assays and animal studies can be used in 
20 formulating a range of dosage for use in humans. The dosage of such compounds lies 
preferably within a range of circulating concentrations that include the ED50 with little 
or no toxicity. The dosage may vary within this range depending upon the dosage form 
employed and the route of administration utilized. For any compound used in the 
method of the invention, the therapeutically effective dose can be estimated initially 
25 from cell culture assays. A dose may be formulated in animal models to achieve a 

circulating plasma concentration range that includes the IC50 (i.e., the concentration of 
the test compound which achieves a half-maximal inhibition of symptoms) as 
determined in cell culture. Such information can be used to more accurately determine 
useful doses in humans. Levels in plasma may be measured, for example, by high 
30 performance liquid chromatography. 
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As defined herein, a therapeutically effective amount of protein or polypeptide 
(/.<?., an effective dosage) ranges from about 0.001 to 30 mg/kg body weight, preferably 
about 0.01 to 25 mg/kg body weight, more preferably about 0.1 to 20 mg/kg body 
weight, and even more preferably about 1 to 1 0 mg/kg, 2 to 9 mg/kg, 3 to 8 mg/kg, 4 to 
5 7 mg/kg, or 5 to 6 mg/kg body weight. The skilled artisan will appreciate that certain 
factors may influence the dosage required to effectively treat a subject, including but not 
limited to the severity of the disease or disorder, previous treatments, the general health 
and/or age of the subject, and other diseases present. Moreover, treatment of a subject 
with a therapeutically effective amount of a protein, polypeptide, or antibody can 

10 include a single treatment or, preferably, can include a series of treatments. In a 

preferred example, a subject is treated with antibody, protein, or polypeptide in the range 
of between about 0.1 to 20 mg/kg body weight, one time per week for between about 1 
to 1 0 weeks, preferably between 2 to 8 weeks, more preferably between about 3 to 7 
weeks, and even more preferably for about 4, 5, or 6 weeks. It will also be appreciated 

1 5 that the effective dosage of antibody, protein, or polypeptide used for treatment may 
increase or decrease over the course of a particular treatment. Changes in dosage may 
result and become apparent from the results of diagnostic assays as described herein. 

The present invention encompasses agents which modulate expression or 
activity. An agent may, for example, be a small molecule. For example, such small 

20 molecules include, but are not limited to, peptides, peptidomimetics, amino acids, amino 
acid analogs, polynucleotides, polynucleotide analogs, nucleotides, nucleotide analogs, 
organic or inorganic compounds (i.e,. including heteroorganic and organometallic 
compounds) having a molecular weight less than about 10,000 grams per mole, organic 
or inorganic compounds having a molecular weight less than about 5,000 grams per 

25 mole, organic or inorganic compounds having a molecular weight less than about 1 ,000 
grams per mole, organic or inorganic compounds having a molecular weight less than 
about 500 grams per mole, and salts, esters, and other pharmaceutically acceptable forms 
of such compounds. 

It is understood that appropriate doses of small molecule agents depends upon a 
30 number of factors within the ken of the ordinarily skilled physician, veterinarian, or 
researcher. The dose(s) of the small molecule will vary, for example, depending upon 
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the identity, size, and condition of the subject or sample being treated, further depending 
upon the route by which the composition is to be administered, if applicable, and the 
effect which the practitioner desires the small molecule to have upon the nucleic acid or 
polypeptide of the invention. 
5 Exemplary doses include milligram or microgram amounts of the small molecule 

per kilogram of subject or sample weight {e.g., about 1 microgram per kilogram to about 
500 milligrams per kilogram, about 100 micrograms per kilogram to about 5 milligrams 
per kilogram, or about 1 microgram per kilogram to about 50 micrograms per kilogram. 
It is furthermore understood that appropriate doses of a small molecule depend upon the 

1 0 potency of the small molecule with respect to the expression or activity to be modulated. 
Such appropriate doses may be determined using the assays described herein. 

When one or more of these small molecules is to be administered to an animal 
{e.g., a human) in order to modulate expression or activity of a polypeptide or nucleic 
acid of the invention, a physician, veterinarian, or researcher may, for example, 

1 5 prescribe a relatively low dose at first, subsequently increasing the dose until an 

appropriate response is obtained. In addition, it is understood that the specific dose level 
for any particular animal subject will depend upon a variety of factors including the 
activity of the specific compound employed, the age, body weight, general health, 
gender, and diet of the subject, the time of administration, the route of administration, 

20 the rate of excretion, any drug combination, and the degree of expression or activity to 
be modulated. 

The nucleic acid molecules of the invention can be inserted into vectors and used 
as gene therapy vectors. Gene therapy vectors can be delivered to a subject by, for 
example, intravenous injection, local administration (see U.S. Patent 5,328,470) or by 

25 stereotactic injection (see e.g., Chen et al (1994) Proc. Natl Acad. ScL USA 91:3054- 
3057). The pharmaceutical preparation of the gene therapy vector can include the gene 
therapy vector in an acceptable diluent, or can comprise a slow release matrix in which 
the gene delivery vehicle is imbedded. Alternatively, where the complete gene delivery 
vector can be produced intact from recombinant cells, e.g., retroviral vectors, the 

30 pharmaceutical preparation can include one or more cells which produce the gene 
delivery system. 
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The pharmaceutical compositions can be included in a container, pack, or 
dispenser together with instructions for administration. 

V. Uses and Methods of the Invention 
5 The nucleic acid molecules, proteins, protein homologues, and antibodies 

described herein can be used in one or more of the following methods: a) screening 
assays; b) predictive medicine (e.g., diagnostic assays, prognostic assays, monitoring 
clinical trials, and pharmacogenetics); and c) methods of treatment (e.g., therapeutic and 
prophylactic). As described herein, an hVR-1, hVR-2, and rVR-2 protein of the 
1 0 invention has one or more of the following activities: (1) it interacts with a non-hVR-1 , 
non-hVR-2, and non-rVR-2 protein molecule, e.g., a vanilloid compound such as 
capsaicin; (2) it modulates intracellular calcium concentration; (3) it activates an hVR- 
1 , hVR-2, and rVR-2-dependent signal transduction pathway; and (4) it modulates a pain 
signaling mechanism, and, thus, can be used to, for example, (1) modulate the 

15 interaction with a non-hVR-1, non-hVR-2, and non-rVR-2 protein molecule; (2) 

modulate intracellular calcium concentration; (3) activate an hVR-1, hVR-2, and rVR- 
2 -dependent signal transduction pathway; and (4) modulate a pain signaling mechanism. 

The isolated nucleic acid molecules of the invention can be used, for example, to 
express hVR-1, hVR-2, and rVR-2 protein (e.g., via a recombinant expression vector in 

20 a host cell in gene therapy applications), to detect hVR-1, hVR-2, and rVR-2 mRNA 
(e.g., in a biological sample) or a genetic alteration in an hVR-1, hVR-2, and rVR-2 
gene, and to modulate hVR-1, hVR-2, and rVR-2 activity, as described further below. 
The hVR-1, hVR-2, and rVR-2 proteins can be used to screen for naturally occurring 
hVR-1, hVR-2, and rVR-2 substrates, to screen for drugs or compounds which modulate 

25 hVR-1 , hVR-2, and rVR-2 activity, as well as to treat disorders characterized by 

insufficient or excessive production of hVR-1, hVR-2, and rVR-2 protein or production 
of hVR-1, hVR-2, and rVR-2 protein forms which have decreased or aberrant activity 
compared to hVR-1, hVR-2, and rVR-2 wild type protein (e.g., pain disorders). 
Moreover, the anti-hVR-1, anti-hVR-2, and anti-rVR-2 antibodies of the invention can 

30 be used to detect and isolate hVR-1, hVR-2, and rVR-2 proteins, regulate the 
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bioavailability of hVR-1, hVR-2, and rVR-2 proteins, and modulate hVR-1, hVR-2, and 
rVR-2 activity. 

A. Screening Assays : 
5 The invention provides a method (also referred to herein as a "screening assay") 

for identifying modulators, i.e., candidate or test compounds or agents (e.g., peptides, 
peptidomimetics, small molecules or other drugs) which bind to hVR-1, hVR-2, and 
rVR-2 proteins, have a stimulatory or inhibitory effect on, for example, hVR-1, hVR-2, 
and rVR-2 expression or hVR-1, hVR-2, and rVR-2 activity, or have a stimulatory or 
1 0 inhibitory effect on, for example, the expression or activity of hVR-1 , hVR-2, and rVR- 
2 substrate. 

In one embodiment, the invention provides assays for screening candidate or test 
compounds which are substrates of an hVR-1, hVR-2, and rVR-2 protein or polypeptide 
or biologically active portion thereof. In another embodiment, the invention provides 
1 5 assays for screening candidate or test compounds which bind to or modulate the activity 
of an hVR-1, hVR-2, and rVR-2 protein or polypeptide or biologically active portion 
thereof. The test compounds of the present invention can be obtained using any of the 
numerous approaches in combinatorial library methods known in the art, including: 
biological libraries; spatially addressable parallel solid phase or solution phase libraries; 

20 synthetic library methods requiring deconvolution; the 'one-bead one-compound' library 
method; and synthetic library methods using affinity chromatography selection. The 
biological library approach is limited to peptide libraries, while the other four 
approaches are applicable to peptide, non-peptide oligomer or small molecule libraries 
of compounds (Lam, K.S. ( 1 997) Anticancer Drug Des. 1 2: 145). 

25 Examples of methods for the synthesis of molecular libraries can be found in the 

art, for example in: DeWitt et al (1993) Proc. Natl. Acad, Set U.S.A. 90:6909; Erb et 
al (1994) Proc. Natl. Acad. ScL USA 91 :1 1422; Zuckermann et al (1994). J. Med. 
Chem. 37:2678; Cho et al (1993) Science 261 :1303; Carrell et al. (1994) Angew. Chem. 
Int. Ed. Engl 33:2059; Carell et al. (1994) Angew. Chem. Int. Ed. Engl 33:2061; and in 

30 Gallop et al (1994) J. Med. Chem. 37:1233. 
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Libraries of compounds may be presented in solution (e.g., Houghten (1992) 
Biotechniques 13:412-421), or on beads (Lam (1991) Nature 354:82-84), chips (Fodor 
(1993) Nature 364:555-556), bacteria (Ladner USP 5,223,409), spores (Ladner USP 
'409), plasmids (Cull et al (1992) Proc Natl Acad Sci USA 89:1865-1869) or on phage 
5 (Scott and Smith (1990) Science 249:386-390); (Devlin (1 990) Science 249:404-406); 
(Cwirla et al (1990) Proc. Natl Acad ScL 87:6378-6382); (Felici (1991) J. Mol Biol 
222:301-310); (Ladner supra.). 

In one embodiment, an assay is a cell-based assay in which a cell, e.g., a 
neuronal cell, which expresses an hVR-1, hVR-2, and rVR-2 protein or biologically 
1 0 active portion thereof is contacted with a test compound and the ability of the test 

compound to modulate hVR-1, hVR-2, and rVR-2 activity is determined. Determining 
the ability of the test compound to modulate hVR-1, hVR-2, and rVR-2 activity can be 
accomplished by monitoring, for example, intracellular calcium concentration or 
membrane depolarization by, e.g., patch-clamp recordings in whole-cell, inside-out, and 
15 outside-out configurations (as described in, for example, Tominaga M. et al (1998) 

Neuron 21 :53 1-543). Determining the ability of the test compound to modulate hVR-1, 
hVR-2, and rVR-2 activity can further be accomplished by monitoring the activity of an 
hVR-1, hVR-2, and rVR-2-regulated transcription factor. The cell, for example, can be 
of mammalian origin, e.g., a neuronal cell. 
20 The ability of the test compound to modulate hVR-1 , hVR-2, and rVR-2 binding 

to a substrate or to bind to hVR-1, hVR-2, and rVR-2 can also be determined. 
Determining the ability of the test compound to modulate hVR-1, hVR-2, and rVR-2 
binding to a substrate can be accomplished, for example, by coupling the hVR-1, hVR- 
2, and rVR-2 substrate with a radioisotope or enzymatic label such that binding of the 
25 hVR-1, hVR-2, and rVR-2 substrate to hVR-1, hVR-2, and rVR-2 can be determined by 
detecting the labeled hVR-1, hVR-2, and rVR-2 substrate in a complex. Determining 
the ability of the test compound to bind hVR-1 , hVR-2, and rVR-2 can be accomplished, 
for example, by coupling the compound with a radioisotope or enzymatic label such that 
binding of the compound to hVR-1, hVR-2, and rVR-2 can be determined by detecting 
30 the labeled hVR-1, hVR-2, and rVR-2 compound in a complex. For example, 

compounds {e.g., hVR-1, hVR-2, and rVR-2 substrates) can be labeled with 125 I, 35^ 
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14 C, or 3 H, either directly or indirectly, and the radioisotope detected by direct counting 
of radioemmission or by scintillation counting. Alternatively, compounds can be 
enzymatically labeled with, for example, horseradish peroxidase, alkaline phosphatase, 
or luciferase, and the enzymatic label detected by determination of conversion of an 
5 appropriate substrate to product. 

It is also within the scope of this invention to determine the ability of a 
compound (e.g., an hVR-1, hVR-2, and rVR-2 substrate) to interact with hVR-1, hVR-2, 
and rVR-2 without the labeling of any of the interactants. For example, a 
microphysiometer can be used to detect the interaction of a compound with hVR-1, 
1 0 hVR-2, and rVR-2 without the labeling of either the compound or the hVR-1 , hVR-2, 
andrVR-2. McConnell, H. ML et al. (1992) Science 257:1906-1912. As used herein, a 
"microphysiometer" (e.g., Cytosensor) is an analytical instrument that measures the rate 
at which a cell acidifies its environment using a light-addressable potentiometric sensor 
(LAPS). Changes in this acidification rate can be used as an indicator of the interaction 
1 5 between a compound and hVR-1 , hVR-2, and rVR-2. 

In yet another embodiment, an assay of the present invention is a cell-free assay 
in which an hVR-1, hVR-2, and rVR-2 protein or biologically active portion thereof is 
contacted with a test compound and the ability of the test compound to bind to the hVR- 
1, hVR-2, and rVR-2 protein or biologically active portion thereof is determined. 
20 biologically active portions of the hVR-1 , hVR-2, and rVR-2 proteins to be used in 

assays of the present invention include fragments which participate in interactions with 
non-hVR-1, non-hVR-2, and non-rVR-2 molecules, e.g., fragments with high surface 
probability scores. Binding of the test compound to the hVR-1, hVR-2, and rVR-2 
protein can be determined either directly or indirectly as described above. In a 
25 embodiment, the assay includes contacting the hVR-1, hVR-2, and rVR-2 protein or 

biologically active portion thereof with a known compound which binds hVR-1, hVR-2, 
and rVR-2 to form an assay mixture, contacting the assay mixture with a test compound, 
and determining the ability of the test compound to interact with an hVR-1, hVR-2, and 
rVR-2 protein, wherein determining the ability of the test compound to interact with an 
30 hVR- 1 , hVR-2, and rVR-2 protein comprises determining the ability of the test 
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compound to preferentially bind to hVR-1, hVR-2, and rVR-2 or biologically active 
portion thereof as compared to the known compound. 

In another embodiment, the assay is a cell-free assay in which an hVR-1, hVR-2, 
and rVR-2 protein or biologically active portion thereof is contacted with a test 
5 compound and the ability of the test compound to modulate (e.g., stimulate or inhibit) 
the activity of the hVR-1, hVR-2, and rVR-2 protein or biologically active portion 
thereof is determined. Determining the ability of the test compound to modulate the 
activity of an hVR-1, hVR-2, and rVR-2 protein can be accomplished, for example, by 
determining the ability of the hVR-1, hVR-2, and rVR-2 protein to bind to an hVR-1, 

10 hVR-2, and rVR-2 target molecule, e.g., a. vanilloid compound such as capsaicin, by one 
of the methods described above for determining direct binding. Determining the ability 
of the hVR-1 , hVR-2, and rVR-2 protein to bind to an hVR-1, hVR-2, and rVR-2 target 
molecule can also be accomplished using a technology such as real-time Biomolecular 
Interaction Analysis (BIA). Sjolander, S. and Urbaniczky, C. (1991) Anal. Chem. 

15 63:2338-2345 and Szabo et ah (1995) Curr. Opin. Struct. Biol 5:699-705. As used 
herein, "BIA" is a technology for studying biospecific interactions in real time, without 
labeling any of the interactants (e.g., BIAcore). Changes in the optical phenomenon of 
surface plasmon resonance (SPR) can be used as an indication of real-time reactions 
between biological molecules. 

20 In an alternative embodiment, determining the ability of the test compound to 

modulate the activity of an hVR-1, hVR-2, and rVR-2 protein can be accomplished by 
determining the ability of the hVR-1, hVR-2, and rVR-2 protein to further modulate the 
activity of a downstream effector of an hVR-1, hVR-2, and rVR-2 target molecule. For 
example, the activity of the effector molecule on an appropriate target can be determined 

25 or the binding of the effector to an appropriate target can be determined as previously 
described. 

In yet another embodiment, the cell-free assay involves contacting an hVR-1, 
hVR-2, and rVR-2 protein or biologically active portion thereof with a known 
compound which binds the hVR-1 , hVR-2, and rVR-2 protein to form an assay mixture, 
30 contacting the assay mixture with a test compound, and determining the ability of the 
test compound to interact with the hVR-1, hVR-2, and rVR-2 protein, wherein 
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determining the ability of the test compound to interact with the hVR-1, hVR-2, and 
rVR-2 protein comprises determining the ability of the hVR-1, hVR-2, and rVR-2 
protein to preferentially bind to or modulate the activity of an hVR-1, hVR-2, and rVR-2 
target molecule. 

5 The cell-free assays of the present invention are amenable to use of both soluble 

and/or membrane-bound forms of isolated proteins (e.g., hVR-1, hVR-2, and rVR-2 
proteins or biologically active portions thereof). In the case of cell-free assays in which 
a membrane-bound form of an isolated protein is used it may be desirable to utilize a 
solubilizing agent such that the membrane-bound form of the isolated protein is 
10 maintained in solution. Examples of such solubilizing agents include non-ionic 

detergents such as n-octylglucoside, n-dodecylglucoside, n-dodecylmaltoside, octanoyl- 
N-methylglucamide, decanoyl-N-methylglucamide, Triton® X-100, Triton® X-l 14, 
Thesit®, Isotridecypoly(ethylene glycol ether) n , 3 -[(3- 
cholamidopropyl)dimethylamminio]-l -propane sulfonate (CHAPS), 3-[(3- 
1 5 cholamidopropyl)dimethylamminio]-2-hydroxy- 1 -propane sulfonate (CHAPSO), or N- 
dodecyl=N,N-dimethy 1-3-ammonio- 1 -propane sulfonate. 

In more than one embodiment of the above assay methods of the present 
invention, it may be desirable to immobilize either hVR-1, hVR-2, and rVR-2 or its 
target molecule to facilitate separation of complexed from uncomplexed forms of one or 
20 both of the proteins, as well as to accommodate automation of the assay. Binding of a 
test compound to an hVR-1, hVR-2, and rVR-2 protein, or interaction of an hVR-1, 
hVR-2, and rVR-2 protein with a target molecule in the presence and absence of a 
candidate compound, can be accomplished in any vessel suitable for containing the 
reactants. Examples of such vessels include microtitre plates, test tubes, and micro- 
25 centrifuge tubes. In one embodiment, a fusion protein can be provided which adds a 
domain that allows one or both of the proteins to be bound to a matrix. For example, 
glutathione-S-transferase/ hVR-1, hVR-2, and rVR-2 fusion proteins or glutathione-S- 
transferase/target fusion proteins can be adsorbed onto glutathione sepharose beads 
(Sigma Chemical, St. Louis, MO) or glutathione derivatized microtitre plates, which are 
30 then combined with the test compound or the test compound and either the non-adsorbed 
target protein or hVR-1 , hVR-2, and rVR-2 protein, and the mixture incubated under 
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conditions conducive to complex formation (e.g., at physiological conditions for salt and 
pH). Following incubation, the beads or microtitre plate wells are washed to remove 
any unbound components, the matrix immobilized in the case of beads, complex 
determined either directly or indirectly, for example, as described above. Alternatively, 
5 the complexes can be dissociated from the matrix, and the level of hVR-1, hVR-2, and 
rVR-2 binding or activity determined using standard techniques. 

Other techniques for immobilizing proteins on matrices can also be used in the 
screening assays of the invention. For example, either an hVR-1, hVR-2, and rVR-2 
protein or an hVR-1, hVR-2, and rVR-2 target molecule can be immobilized utilizing 
10 conjugation of biotin and streptavidin. Biotinylated hVR-1, hVR-2, and rVR-2 protein 
or target molecules can be prepared from biotin-NHS (N-hydroxy-succinimide) using 
techniques known in the art (e.g., biotinylation kit, Pierce Chemicals, Rockford, IL), and 
immobilized in the wells of streptavidin-coated 96 well plates (Pierce Chemical). 
Alternatively, antibodies reactive with hVR-1, hVR-2, and rVR-2 protein or target 

15 molecules but which do not interfere with binding of the hVR-1, hVR-2, and rVR-2 
protein to its target molecule can be derivatized to the wells of the plate, and unbound 
target or hVR-1, hVR-2, and rVR-2 protein trapped in the wells by antibody 
conjugation. Methods for detecting such complexes, in addition to those described 
above for the GST-immobilized complexes, include immunodetection of complexes 

20 using antibodies reactive with the hVR-1 , hVR-2, and rVR-2 protein or target molecule, 
as well as enzyme-linked assays which rely on detecting an enzymatic activity 
associated with the hVR-1, hVR-2, and rVR-2 protein or target molecule. 

In another embodiment, modulators of hVR-1, hVR-2, and rVR-2 expression are 
identified in a method wherein a cell is contacted with a candidate compound and the 

25 expression of hVR-1, hVR-2, and rVR-2 mRNA or protein in the cell is determined. 
The level of expression of hVR-1, hVR-2, and rVR-2 mRNA or protein in the presence 
of the candidate compound is compared to the level of expression of hVR-1, hVR-2, and 
rVR-2 mRNA or protein in the absence of the candidate compound. The candidate 
compound can then be identified as a modulator of hVR-1, hVR-2, and rVR-2 

30 expression based on this comparison. For example, when expression of hVR-1, hVR-2, 
and rVR-2 mRNA or protein is greater (statistically significantly greater) in the presence 
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of the candidate compound than in its absence, the candidate compound is identified as a 
stimulator of hVR-1, hVR-2, and rVR-2 mRNA or protein expression. Alternatively, 
when expression of hVR-1, hVR-2, and rVR-2 mRNA or protein is less (statistically 
significantly less) in the presence of the candidate compound than in its absence, the 
5 candidate compound is identified as an inhibitor of hVR-1 , hVR-2, and rVR-2 mRNA or 
protein expression. The level of hVR-1, hVR-2, and rVR-2 mRNA or protein 
expression in the cells can be determined by methods described herein for detecting 
hVR-1, hVR-2, and rVR-2 mRNA or protein. 

In yet another aspect of the invention, the hVR-1, hVR-2, and rVR-2 proteins 
10 can be used as "bait proteins" in a two-hybrid assay or three-hybrid assay (see, e.g., U.S. 
Patent No. 5,283,3 17; Zervos et al (1993) Cell 72:223-232; Madura et al (1993) J. 
Biol Chem. 268:12046-12054; Bartel et al. (1993) Biotechniques 14:920-924; Iwabuchi 
etal (1993) Oncogene 8:1693-1696; and Brent WO94/10300), to identify other 
proteins, which bind to or interact with hVR-1, hVR-2, and rVR-2 ("hVR-1 -binding 
15 proteins", "hVR-2-binding proteins", and "rVR-2-binding proteins" or n hVR-l-bp", 
"hVR-2-bp", and "rVR-2-bp") and are involved in hVR-1, hVR-2, and rVR-2 activity. 
Such hVR-1, hVR-2, and rVR-2-binding proteins are also likely to be involved in the 
propagation of signals by the hVR-1, hVR-2, and rVR-2 proteins or hVR-1, hVR-2, and 
rVR-2 targets as, for example, downstream elements of an hVR-1, hVR-2, and rVR-2- 
20 mediated signaling pathway, e.g., a pain signaling pathway. Alternatively, such hVR-1, 
hVR-2, and rVR-2-binding proteins are likely to be hVR-1, hVR-2, and rVR-2 
inhibitors. 

The two-hybrid system is based on the modular nature of most transcription 
factors, which consist of separable DNA-binding and activation domains. Briefly, the 

25 assay utilizes two different DNA constructs. In one construct, the gene that codes for an 
hVR-1 , hVR-2, and rVR-2 protein is fused to a gene encoding the DNA binding domain 
of a known transcription factor (e.g., GAL-4). In the other construct, a DNA sequence, 
from a library of DNA sequences, that encodes an unidentified protein ("prey" or 
"sample") is fused to a gene that codes for the activation domain of the known 

30 transcription factor. If the "bait" and the "prey" proteins are able to interact, in vivo, 
forming an hVR-1, hVR-2, and rVR-2-dependent complex, the DNA-binding and 
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activation domains of the transcription factor are brought into close proximity. This 
proximity allows transcription of a reporter gene (e.g., LacZ) which is operably linked to 
a transcriptional regulatory site responsive to the transcription factor. Expression of the 
reporter gene can be detected and cell colonies containing the functional transcription 
5 factor can be isolated and used to obtain the cloned gene which encodes the protein 
which interacts with the hVR-1, hVR-2, and rVR-2 protein. 

This invention further pertains to novel agents identified by the above-described 
screening assays. Accordingly, it is within the scope of this invention to further use an 
agent identified as described herein in an appropriate animal model. For example, an 

1 0 agent identified as described herein (e.g. , an hVR-1 , hVR-2, and rVR-2 modulating 

agent, an antisense hVR-1, hVR-2, and rVR-2 nucleic acid molecule, an hVR-1, hVR-2, 
and rVR-2-specific antibody, or an hVR-1, hVR-2, and rVR-2-binding partner) can be 
used in an animal model to determine the efficacy, toxicity, or side effects of treatment 
with such an agent. Alternatively, an agent identified as described herein can be used in 

1 5 an animal model to determine the mechanism of action of such an agent. Furthermore, 
this invention pertains to uses of novel agents identified by the above-described 
screening assays for treatments as described herein. 

B. Detection Assays 

20 Portions or fragments of the cDNA sequences identified herein (and the 

corresponding complete gene sequences) can be used in numerous ways as 
polynucleotide reagents. For example, these sequences can be used to: (i) map their 
respective genes on a chromosome; and, thus, locate gene regions associated with 
genetic disease; (ii) identify an individual from a minute biological sample (tissue 

25 typing); and (iii) aid in forensic identification of a biological sample. These applications 
are described in the subsections below. 

1 . Chromosome Mapping 

Once the sequence (or a portion of the sequence) of a gene has been isolated, this 
30 sequence can be used to map the location of the gene on a chromosome. This process is 
called chromosome mapping. Accordingly, portions or fragments of the hVR-1, hVR-2, 
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and rVR-2 nucleotide sequences, described herein, can be used to map the location of 
the hVR-1 , hVR-2, and rVR-2 genes on a chromosome. The mapping of the hVR-1, 
hVR-2, and rVR-2 sequences to chromosomes is an important first step in correlating 
these sequences with genes associated with disease. 
5 Briefly, hVR-1 , hVR-2, and rVR-2 genes can be mapped to chromosomes by 

preparing PCR primers (preferably 15-25 bp in length) from the hVR-1, hVR-2, and 
rVR-2 nucleotide sequences. Computer analysis of the hVR-1, hVR-2, and rVR-2 
sequences can be used to predict primers that do not span more than one exon in the 
genomic DNA, thus complicating the amplification process. These primers can then be 
10 used for PCR screening of somatic cell hybrids containing individual human 

chromosomes. Only those hybrids containing the human gene corresponding to the 
hVR-L hVR-2, and rVR-2 sequences will yield an amplified fragment. 

Somatic cell hybrids are prepared by fusing somatic cells from different 
mammals {e.g., human and mouse cells). As hybrids of human and mouse cells grow 
15 and divide, they gradually lose human chromosomes in random order, but retain the 
mouse chromosomes. By using media in which mouse cells cannot grow, because they 
lack a particular enzyme, but human cells can, the one human chromosome that contains 
the gene encoding the needed enzyme, will be retained. By using various media, panels 
of hybrid cell lines can be established. Each cell line in a panel contains either a single 
20 human chromosome or a small number of human chromosomes, and a full set of mouse 
chromosomes, allowing easy mapping of individual genes to specific human 
chromosomes. (D'Eustachio P. et al (1983) Science 220:919-924). Somatic cell 
hybrids containing only fragments of human chromosomes can also be produced by 
using human chromosomes with translocations and deletions. 
25 PCR mapping of somatic cell hybrids is a rapid procedure for assigning a 

particular sequence to a particular chromosome. Three or more sequences can be 
assigned per day using a single thermal cycler. Using the hVR-1, hVR-2, and rVR-2 
nucleotide sequences to design oligonucleotide primers, sublocalization can be achieved 
with panels of fragments from specific chromosomes. Other mapping strategies which 
30 can similarly be used to map an hVR-1 , hVR-2, and rVR-2 sequence to its chromosome 
include in situ hybridization (described in Fan, Y. et al (1990) Proc. Natl. Acad ScL 
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USA, 87:6223-27), pre-screening with labeled flow-sorted chromosomes, and pre- 
selection by hybridization to chromosome specific cDNA libraries. 

Fluorescence in situ hybridization (FISH) of a DNA sequence to a metaphase 
chromosomal spread can further be used to provide a precise chromosomal location in 
5 one step^ Chromosome spreads can be made using cells whose division has been 

blocked in metaphase by a chemical such as colcemid that disrupts the mitotic spindle. 
The chromosomes can be treated briefly with trypsin, and then stained with Giemsa. A 
pattern of light and dark bands develops on each chromosome, so that the chromosomes 
can be identified individually. The FISH technique can be used with a DNA sequence 
10 as short as 500 or 600 bases. However, clones larger than 1 ? 000 bases have a higher 
likelihood of binding to a unique chromosomal location with sufficient signal intensity 
for simple detection. Preferably 1,000 bases, and more preferably 2,000 bases will 
suffice to get good results at a reasonable amount of time. For a review of this 
technique, see Verma^ a/., Human Chromosomes: A Manual of Basic Techniques 
1 5 (Pergamon Press, New York 1988). 

Reagents for chromosome mapping can be used individually to mark a single 
chromosome or a single site on that chromosome, or panels of reagents can be used for 
marking multiple sites and/or multiple chromosomes. Reagents corresponding to 
noncoding regions of the genes actually are for mapping purposes. Coding sequences 
20 are more likely to be conserved within gene families, thus increasing the chance of cross 
hybridizations during chromosomal mapping. 

Once a sequence has been mapped to a precise chromosomal location, the 
physical position of the sequence on the chromosome can be correlated with genetic map 
data. (Such data are found, for example, in V. McKusick, Mendelian Inheritance in 
25 Man, available on-line through Johns Hopkins University Welch Medical Library). The 
relationship between a gene and a disease, mapped to the same chromosomal region, can 
then be identified through linkage analysis (co-inheritance of physically adjacent genes), 
described in, for example, Egeland, J. et al (1987) Nature, 325:783-787. 

Moreover, differences in the DNA sequences between individuals affected and 
30 unaffected with a disease associated with the hVR-1, hVR-2, and rVR-2 gene, can be 
determined. If a mutation is observed in some or all of the affected individuals but not 
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in any unaffected individuals, then the mutation is likely to be the causative agent of the 
particular disease. Comparison of affected and unaffected individuals generally involves 
first looking for structural alterations in the chromosomes, such as deletions or 
translocations that are visible from chromosome spreads or detectable using PCR based 
5 on that DNA sequence. Ultimately, complete sequencing of genes from several 

individuals can be performed to confirm the presence of a mutation and to distinguish 
mutations from polymorphisms. 



2. Tissue Typing 

10 The hVR-1, hVR-2, and rVR-2 sequences of the present invention can also be 

used to identify individuals from minute biological samples. The United States military, 
for example, is considering the use of restriction fragment length polymorphism (RFLP) 
for identification of its personnel. In this technique, an individual's genomic DNA is 
digested with one or more restriction enzymes, and probed on a Southern blot to yield 
1 5 unique bands for identification. This method does not suffer from the current limitations 
of "Dog Tags" which can be lost, switched, or stolen, making positive identification 
difficult. The sequences of the present invention are useful as additional DNA markers 
for RFLP (described in U.S. Patent 5,272,057). 

Furthermore, the sequences of the present invention can be used to provide an 

20 alternative technique which determines the actual base-by-base DNA sequence of 
selected portions of an individual's genome. Thus, the hVR-1, hVR-2, and rVR-2 
nucleotide sequences described herein can be used to prepare two PCR primers from the 
5' and 3' ends of the sequences. These primers can then be used to amplify an 
individual's DNA and subsequently sequence it. 

25 Panels of corresponding DNA sequences from individuals, prepared in this 

manner, can provide unique individual identifications, as each individual will have a 
unique set of such DNA sequences due to allelic differences. The sequences of the 
present invention can be used to obtain such identification sequences from individuals 
and from tissue. The hVR-1, hVR-2, and rVR-2 nucleotide sequences of the invention 

30 uniquely represent portions of the human genome. Allelic variation occurs to some 

degree in the coding regions of these sequences, and to a greater degree in the noncoding 
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regions. It is estimated that allelic variation between individual humans occurs with a 
frequency of about once per each 500 bases. Each of the sequences described herein 
can, to some degree, be used as a standard against which DNA from an individual can be 
compared for identification purposes. Because greater numbers of polymorphisms occur 
5 in the noncoding regions, fewer sequences are necessary to differentiate individuals. 

If a panel of reagents from hVR-1, hVR-2, and rVR-2 nucleotide sequences 
described herein is used to generate a unique identification database for an individual, 
those same reagents can later be used to identify tissue from that individual. Using the 
unique identification database, positive identification of the individual, living or dead, 
10 can be made from extremely small tissue samples. 

3. Use of Partial hVR-1, hVR-2, and rVR-2 Sequences in Forensic Biology 
DNA-based identification techniques can also be used in forensic biology. 
Forensic biology is a scientific field employing genetic typing of biological evidence 
1 5 found at a crime scene as a means for positively identifying, for example, a perpetrator 
of a crime. To make such an identification, PCR technology can be used to amplify 
DNA sequences taken from very small biological samples such as tissues, e.g., hair or 
skin, or body fluids, e.g., blood, saliva, or semen found at a crime scene. The amplified 
sequence can then be compared to a standard, thereby allowing identification of the 
20 origin of the biological sample. 

The sequences of the present invention can be used to provide polynucleotide 
reagents, e.g., PCR primers, targeted to specific loci in the human genome, which can 
enhance the reliability of DNA-based forensic identifications by, for example, providing 
another "identification marker" (i.e. another DNA sequence that is unique to a particular 
25 individual). As mentioned above, actual base sequence information can be used for 
identification as an accurate alternative to patterns formed by restriction enzyme 
generated fragments. Examples of polynucleotide reagents include the hVR-1, hVR-2, 
and rVR-2 nucleotide sequences or portions thereof, e.g., fragments derived from SEQ 
ID NO:l, 3, 4, 6, 7, 9, 10, or 1 1 having a length of at least 20 bases, preferably at least 
30 30 bases. 



BNSDOCID: <WO 0O29577A1_IA> 



WO 00/29577 PCT/US99/26701 

- 72 - 

The hVR-1, hVR-2, and rVR-2 nucleotide sequences described herein can further 
be used to provide polynucleotide reagents, e.g., labeled or labelable probes which can 
be used in, for example, an in situ hybridization technique, to identify a specific tissue, 
e.g., brain tissue. This can be very useful in cases where a forensic pathologist is 
5 presented with a tissue of unknown origin. Panels of such hVR-1 , hVR-2, and rVR-2 
probes can be used to identify tissue by species and/or by organ type. 

In a similar fashion, these reagents, e.g., hVR-1, hVR-2 ? and rVR-2 primers or 
probes can be used to screen tissue culture for contamination (i.e. screen for the presence 
of a mixture of different types of cells in a culture). 

10 

C. Predictive Medicine : 

The present invention also pertains to the field of predictive medicine in which 
diagnostic assays, prognostic assays, and monitoring clinical trials are used for 
prognostic (predictive) purposes to thereby treat an individual prophylactically. 

1 5 Accordingly, one aspect of the present invention relates to diagnostic assays for 

determining hVR-1 , hVR-2, and rVR-2 protein and/or nucleic acid expression as well as 
hVR-1 , hVR-2, and rVR-2 activity, in the context of a biological sample (e.g, blood, 
serum, cells, tissue) to thereby determine whether an individual is afflicted with a 
disease or disorder, or is at risk of developing a disorder, associated with aberrant hVR- 

20 1 , hVR-2, and rVR-2 expression or activity. The invention also provides for prognostic 
(or predictive) assays for determining whether an individual is at risk of developing a 
disorder associated with hVR-1, hVR-2, and rVR-2 protein, nucleic acid expression or 
activity. For example, mutations in an hVR-1, hVR-2, and rVR-2 gene can be assayed 
in a biological sample. Such assays can be used for prognostic or predictive purpose to 

25 thereby phophylactically treat an individual prior to the onset of a disorder characterized 
by or associated with hVR-1, hVR-2, and rVR-2 protein, nucleic acid expression or 
activity. 

Another aspect of the invention pertains to monitoring the influence of agents 
(e.g., drugs, compounds) on the expression or activity of hVR-1, hVR-2, and rVR-2 in 
30 clinical trials. 

These and other agents are described in further detail in the following sections. 
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I. Diagnostic Assays 

An exemplary method for detecting the presence or absence of hVR-1, hVR-2, 
and rVR-2 protein or nucleic acid in a biological sample involves obtaining a biological 
sample from a test subject and contacting the biological sample with a compound or an 
5 agent capable of detecting hVR-1, hVR-2, and rVR-2 protein or nucleic acid (e.g., 
mRNA, genomic DNA) that encodes hVR-1, hVR-2, and rVR-2 protein such that the 
presence of hVR-1, hVR-2, and rVR-2 protein or nucleic acid is detected in the 
biological sample. A agent for detecting hVR-1, hVR-2, and rVR-2 mRNA or genomic 
DNA is a labeled nucleic acid probe capable of hybridizing to hVR-1, hVR-2, and rVR- 
10 2 mRNA or genomic DNA. The nucleic acid probe can be, for example, a full-length 
hVR-1 , hVR-2, and rVR-2 nucleic acid, such as the nucleic acid of SEQ ID NO:l, 3, 4, 
6, 7, 9, 10, or 12, or a portion thereof, such as an oligonucleotide of at least 15, 30, 50, 
100, 250 or 500 nucleotides in length and sufficient to specifically hybridize under 
stringent conditions to hVR-1, hVR-2, and rVR-2 mRNA or genomic DNA. Other 
1 5 suitable probes for use in the diagnostic assays of the invention are described herein. 

An agent for detecting hVR-1, hVR-2, and rVR-2 protein is an antibody capable 
of binding to hVR-1, hVR-2, and rVR-2 protein, preferably an antibody with a 
detectable label. Antibodies can be polyclonal, or more preferably, monoclonal. An 
intact antibody, or a fragment thereof (e.g., Fab or F(ab')2) can be used. The term 
20 "labeled", with regard to the probe or antibody, is intended to encompass direct labeling 
of the probe or antibody by coupling (i.e., physically linking) a detectable substance to 
the probe or antibody, as well as indirect labeling of the probe or antibody by reactivity 
with another reagent that is directly labeled. Examples of indirect labeling include 
detection of a primary antibody using a fluorescently labeled secondary antibody and 
25 end-labeling of a DNA probe with biotin such that it can be detected with fluorescently 
labeled streptavidin. The term "biological sample" is intended to include tissues, cells 
and biological fluids isolated from a subject, as well as tissues, cells and fluids present 
within a subject. That is, the detection method of the invention can be used to detect 
hVR-1, hVR-2, and rVR-2 mRNA, protein, or genomic DNA in a biological sample in 
30 vitro as well as in vivo. For example, in vitro techniques for detection of hVR-1, hVR-2, 
and rVR-2 mRNA include Northern hybridizations and in situ hybridizations. In vitro 
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techniques for detection of hVR-1, hVR-2, and rVR-2 protein include enzyme linked 
immunosorbent assays (ELISAs), Western blots, immunoprecipitations and 
immunofluorescence. In vitro techniques for detection of hVR-1, hVR-2, and rVR-2 
genomic DNA include Southern hybridizations. Furthermore, in vivo techniques for 
5 detection of h VR- 1 , hVR-2, and rVR-2 protein include introducing into a subject a 
labeled anti-hVR-1, hVR-2, and rVR-2 antibody. For example, the antibody can be 
labeled with a radioactive marker whose presence and location in a subject can be 
detected by standard imaging techniques. 

In one embodiment, the biological sample contains protein molecules from the 
10 test subject. Alternatively, the biological sample can contain mRNA molecules from the 
test subject or genomic DNA molecules from the test subject. A biological sample is a 
serum sample isolated by conventional means from a subject. 

In another embodiment, the methods further involve obtaining a control 
biological sample from a control subject, contacting the control sample with a 
1 5 compound or agent capable of detecting hVR-1 , hVR-2, and rVR-2 protein, mRNA, or 
genomic DNA, such that the presence of hVR-1, hVR-2, and rVR-2 protein, mRNA or 
genomic DNA is detected in the biological sample, and comparing the presence of hVR- 

1, hVR-2, and rVR-2 protein, mRNA or genomic DNA in the control sample with the 
presence of hVR-1, hVR-2, and rVR-2 protein, mRNA or genomic DNA in the test 

20 sample. 

The invention also encompasses kits for detecting the presence of hVR-1, hVR- 

2, and rVR-2 in a biological sample. For example, the kit can comprise a labeled 
compound or agent capable of detecting hVR-1, hVR-2, and rVR-2 protein or mRNA in 
a biological sample; means for determining the amount of hVR-1, hVR-2, and rVR-2 in 

25 the sample; and means for comparing the amount of hVR-1, hVR-2, and rVR-2 in the 
sample with a standard. The compound or agent can be packaged in a suitable container. 
The kit can further comprise instructions for using the kit to detect hVR-1, hVR-2, and 
rVR-2 protein or nucleic acid. 
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2. Prognostic Assays 

The diagnostic methods described herein can furthermore be utilized to identify 
subjects having or at risk of developing a disease or disorder associated with aberrant 
hVR-1, hVR-2, and rVR-2 expression or activity. As used herein, the term "aberrant" 
5 includes an hVR-1, hVR-2, and rVR-2 expression or activity which deviates from the 
wild type hVR-1, hVR-2, and rVR-2 expression or activity. Aberrant expression or 
activity includes increased or decreased expression or activity, as well as expression or 
activity which does not follow the wild type developmental pattern of expression or the 
subcellular pattern of expression. For example, aberrant hVR-1, hVR-2, and rVR-2 

10 expression or activity is intended to include the cases in which a mutation in the hVR-1, 
hVR-2, and rVR-2 gene causes the hVR-1, hVR-2, and rVR-2 gene to be under- 
expressed or over-expressed and situations in which such mutations result in a non- 
functional hVR-1, hVR-2, and rVR-2 protein or a protein which does not function in a 
wild-type fashion, e.g., a protein which does not interact with an hVR-1, hVR-2, and 

15 rVR-2 ligand or one which interacts with a non-hVR-1, non-hVR-2, and non-rVR-2 
ligand. 

The assays described herein, such as the preceding diagnostic assays or the 
following assays, can be utilized to identify a subject having or at risk of developing a 
disorder associated with a misregulation in hVR-1, hVR-2, and rVR-2 protein activity or 

20 nucleic acid expression, such as a pain disorder. Alternatively, the prognostic assays can 
be utilized to identify a subject having or at risk for developing a disorder associated 
with a misregulation in hVR-1, hVR-2, and rVR-2 protein activity or nucleic acid 
expression, such as a pain disorder. Thus, the present invention provides a method for 
identifying a disease or disorder associated with aberrant hVR-1, hVR-2, and rVR-2 

25 expression or activity in which a test sample is obtained from a subject and hVR-1, 
hVR-2, and rVR-2 protein or nucleic acid (e.g., mRNA or genomic DNA) is detected, 
wherein the presence of hVR-1, hVR-2, and rVR-2 protein or nucleic acid is diagnostic 
for a subject having or at risk of developing a disease or disorder associated with 
aberrant hVR-1 , hVR-2, and rVR-2 expression or activity. As used herein, a "test 

30 sample" refers to a biological sample obtained from a subject of interest. For example, a 
test sample can be a biological fluid (e.g., serum), cell sample, or tissue. 
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Furthermore, the prognostic assays described herein can be used to determine 
whether a subject can be administered an agent (e.g., an agonist, antagonist, 
peptidomimetic, protein, peptide, nucleic acid, small molecule, or other drug candidate) 
to treat a disease or disorder associated with aberrant hVR-1, hVR-2, and rVR-2 
5 expression or activity. For example, such methods can be used to determine whether a 
subject can be effectively treated with an agent for a pain disorder. Thus, the present 
invention provides methods for determining whether a subject can be effectively treated 
with an agent for a disorder associated with aberrant hVR-1, hVR-2, and rVR-2 
expression or activity in which a test sample is obtained and hVR-1, hVR-2, and rVR-2 
1 0 protein or nucleic acid expression or activity is detected (e.g. , wherein the abundance of 
hVR-1, hVR-2, and rVR-2 protein or nucleic acid expression or activity is diagnostic for 
a subject that can be administered the agent to treat a disorder associated with aberrant 
hVR-L hVR-2, and rVR-2 expression or activity). 

The methods of the invention can also be used to detect genetic alterations in an 
15 hVR-1 , hVR-2, and rVR-2 gene, thereby determining if a subject with the altered gene is 
at risk for a disorder characterized by misregulation in hVR-1, hVR-2, and rVR-2 
protein activity or nucleic acid expression, such as a neurodegenerative disorder. In 
embodiments, the methods include detecting, in a sample of cells from the subject, the 
presence or absence of a genetic alteration characterized by at least one of an alteration 
20 affecting the integrity of a gene encoding an hVR- 1 , hVR-2, and rVR-2-protein, or the 
mis-expression of the hVR-1, hVR-2, and rVR-2 gene. For example, such genetic 
alterations can be detected by ascertaining the existence of at least one of 1) a deletion of 
one or more nucleotides from an hVR-1, hVR-2, and rVR-2 gene; 2) an addition of one 
or more nucleotides to an hVR-1, hVR-2, and rVR-2 gene; 3) a substitution of one or 
25 more nucleotides of an hVR-1 , hVR-2, and rVR-2 gene, 4) a chromosomal 

rearrangement of an hVR-1, hVR-2, and rVR-2 gene; 5) an alteration in the level of a 
messenger RNA transcript of an hVR-1, hVR-2, and rVR-2 gene, 6) aberrant 
modification of an hVR-1, hVR-2, and rVR-2 gene, such as of the methylation pattern of 
the genomic DNA, 7) the presence of a non-wild type splicing pattern of a messenger 
30 RNA transcript of an hVR-1, hVR-2, and rVR-2 gene, 8) a non-wild type level of an 
hVR-1, hVR-2, and rVR-2-protein, 9) allelic loss of an hVR-1, hVR-2, and rVR-2 gene, 
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and 10) inappropriate post-translational modification of an hVR-1, hVR-2, and rVR-2- 
protein. As described herein, there are a large number of assays known in the art which 
can be used for detecting alterations in an hVR-1, hVR-2, and rVR-2 gene. A biological 
sample is a tissue or serum sample isolated by conventional means from a subject. 
5 In certain embodiments, detection of the alteration involves the use of a 

probe/primer in a polymerase chain reaction (PCR) (see, e.g., U.S. Patent Nos. 
4,683,195 and 4,683,202), such as anchor PCR or RACE PCR, or, alternatively, in a 
ligation chain reaction (LCR) (see, e.g., Landegran et al. (1988) Science 241 : 1077- 1080; 
and Nakazawa et al. (1994) Proc. Natl Acad. Sci. USA 91:360-364), the latter of which 
10 can be particularly useful for detecting point mutations in the hVR-1 , hVR-2, and rVR- 
2-gene (see Abravaya et al (1995) Nucleic Acids Res .23:675-682). This method can 
include the steps of collecting a sample of cells from a subject, isolating nucleic acid 
(e.g., genomic, mRNA or both) from the cells of the sample, contacting the nucleic acid 
sample with one or more primers which specifically hybridize to an hVR-1, hVR-2, and 
15 rVR-2 gene under conditions such that hybridization and amplification of the hVR-1, 
hVR-2, and rVR-2-gene (if present) occurs, and detecting the presence or absence of an 
amplification product, or detecting the size of the amplification product and comparing 
the length to a control sample. It is anticipated that PCR and/or LCR may be desirable 
to use as a preliminary amplification step in conjunction with any of the techniques used 
20 for detecting mutations described herein. 

Alternative amplification methods include: self sustained sequence replication 
(Guatelli, J.C. et al, (1990) Proc. Natl Acad. Sci. USA 87:1874-1878), transcriptional 
amplification system (Kwoh, D.Y. et al, (1989) Proc. Natl Acad. Sci. USA 86:1 173- 
1 177), Q-Beta Replicase (Lizardi, P.M. et al (1988) Bio-Technology 6:1 197), or any 
25 other nucleic acid amplification method, followed by the detection of the amplified 
molecules using techniques well known to those of skill in the art. These detection 
schemes are especially useful for the detection of nucleic acid molecules if such 
molecules are present in very low numbers. 

In an alternative embodiment, mutations in an hVR-1, hVR-2, and rVR-2 gene 
30 from a sample cell can be identified by alterations in restriction enzyme cleavage 
patterns. For example, sample and control DNA is isolated, amplified (optionally), 
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digested with one or more restriction endonucleases, and fragment length sizes are 
determined by gel electrophoresis and compared. Differences in fragment length sizes 
between sample and control DNA indicates mutations in the sample DNA. Moreover, 
the use of sequence specific ribozymes (see, for example, U.S. Patent No. 5,498,531) 
5 can be used to score for the presence of specific mutations by development or loss of a 
ribozyme cleavage site. 

In other embodiments, genetic mutations in hVR-1, hVR-2, and rVR-2 can be 
identified by hybridizing a sample and control nucleic acids, e.g., DNA or RNA, to high 
density arrays containing hundreds or thousands of oligonucleotides probes (Cronin, 
10 M.T. et al (1996) Human Mutation 7: 244-255; Kozal, M.J. et ai (1996) Nature 

Medicine 2: 753-759). For example, genetic mutations in hVR-1 , hVR-2, and rVR-2 
can be identified in two dimensional arrays containing light-generated DNA probes as 
described in Cronin, M.T. et ai supra. Briefly, a first hybridization array of probes can 
be used to scan through long stretches of DNA in a sample and control to identify base 
1 5 changes between the sequences by making linear arrays of sequential overlapping 

probes. This step allows the identification of point mutations. This step is followed by 
a second hybridization array that allows the characterization of specific mutations by 
using smaller, specialized probe arrays complementary to all variants or mutations 
detected. Each mutation array is composed of parallel probe sets, one complementary to 
20 the wild-type gene and the other complementary to the mutant gene. 

In yet another embodiment, any of a variety of sequencing reactions known in 
the art can be used to directly sequence the hVR-1, hVR-2, and rVR-2 gene and detect 
mutations by comparing the sequence of the sample hVR-1, hVR-2, and rVR-2 with the 
corresponding wild-type (control) sequence. Examples of sequencing reactions include 
25 those based on techniques developed by Maxam and Gilbert ((1977) Proc. Natl Acad. 
ScL USA 74:560) or Sanger ((1977) Proc. Natl Acad. ScL USA 74:5463). It is also 
contemplated that any of a variety of automated sequencing procedures can be utilized 
when performing the diagnostic assays ((1995) Biotechniques 19:448), including 
sequencing by mass spectrometry (see, e.g., PCT International Publication No. WO 
30 94/16101; Cohen et al (1996) Adv. Chromatogr. 36:127-162; and Griffin et ai (1993) 
Appl. Biochem. BiotechnoL 38:147-159). 
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Other methods for detecting mutations in the hVR-1, hVR-2, and rVR-2 gene 
include methods in which protection from cleavage agents is used to detect mismatched 
bases in RNA/RNA or RNA/DNA heteroduplexes (Myers et al (1985) Science 
230:1242). In general, the art technique of "mismatch cleavage" starts by providing 
5 heteroduplexes of formed by hybridizing (labeled) RNA or DNA containing the wild- 
type hVR-1, hVR-2, and rVR-2 sequence with potentially mutant RNA or DNA 
obtained from a tissue sample. The double-stranded duplexes are treated with an agent 
which cleaves single-stranded regions of the duplex such as which will exist due to 
basepair mismatches between the control and sample strands. For instance, RNA/DNA 

1 0 duplexes can be treated with RNase and DN A/DNA hybrids treated with S 1 nuclease to 
enzymatically digesting the mismatched regions. In other embodiments, either 
DNA/DNA or RNA/DNA duplexes can be treated with hydroxylamine or osmium 
tetroxide and with piperidine in order to digest mismatched regions. After digestion of 
the mismatched regions, the resulting material is then separated by size on denaturing 

15 polyacrylamide gels to determine the site of mutation. See, for example, Cotton et al 
(1988) Proc. Natl Acad Sci USA 85:4397; Saleeba et al (1992) Methods Enzymol 
217:286-295. In a embodiment, the control DNA or RNA can be labeled for detection. 

In still another embodiment, the mismatch cleavage reaction employs one or 
more proteins that recognize mismatched base pairs in double-stranded DNA (so called 

20 "DNA mismatch repair" enzymes) in defined systems for detecting and mapping point 
mutations in hVR-1, hVR-2, and rVR-2 cDNAs obtained from samples of cells. For 
example, the mutY enzyme of E. coli cleaves A at G/A mismatches and the thymidine 
DNA glycosylase from HeLa cells cleaves T at G/T mismatches (Hsu et al (1994) 
Carcinogenesis 15:1657-1662). According to an exemplary embodiment, a probe based 

25 on an hVR-1, hVR-2, and rVR-2 sequence, e.g., a wild-type hVR-1, hVR-2, and rVR-2 
sequence, is hybridized to a cDNA or other DNA product from a test cell(s). The duplex 
is treated with a DNA mismatch repair enzyme, and the cleavage products, if any, can be 
detected from electrophoresis protocols or the like. See, for example, U.S. Patent No. 
5,459,039. 
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In other embodiments, alterations in electrophoretic mobility will be used to 
identify mutations in hVR-1, hVR-2, and rVR-2 genes. For example, single strand 
conformation polymorphism (SSCP) may be used to detect differences in electrophoretic 
mobility between mutant and wild type nucleic acids (orita et al (1989) Proc Natl 
5 Acad. Sci USA: 86:2766, see also Cotton (1993) MutaL Res. 285:125-144; and Hayashi 
(1992) Genet. Anal. Tech. Appl. 9:73-79). Single-stranded DNA fragments of sample 
and control hVR-1, hVR-2, and rVR-2 nucleic acids will be denatured and allowed to 
renature. The secondary structure of single-stranded nucleic acids varies according to 
sequence, the resulting alteration in electrophoretic mobility enables the detection of 
10 even a single base change. The DNA fragments may be labeled or detected with labeled 
probes. The sensitivity of the assay may be enhanced by using RNA (rather than DNA), 
in which the secondary structure is more sensitive to a change in sequence. In a 
embodiment, the subject method utilizes heteroduplex analysis to separate double 
stranded heteroduplex molecules on the basis of changes in electrophoretic mobility 
15 (Keen et ai (1991) Trends Genet 7:5). 

In yet another embodiment the movement of mutant or wild-type fragments in 
polyacrylamide gels containing a gradient of denaturant is assayed using denaturing 
gradient gel electrophoresis (DGGE) (Myers et al. (1985) Nature 313:495). When 
DGGE is used as the method of analysis, DNA will be modified to insure that it does not 
20 completely denature, for example by adding a GC clamp of approximately 40 bp of 

high-melting GC-rich DNA by PCR. In a further embodiment, a temperature gradient is 
used in place of a denaturing gradient to identify differences in the mobility of control 
and sample DNA (Rosenbaum and Reissner (1987) Biophys Chem 265:12753). 

Examples of other techniques for detecting point mutations include, but are not 
25 limited to, selective oligonucleotide hybridization, selective amplification, or selective 
primer extension. For example, oligonucleotide primers may be prepared in which the 
known mutation is placed centrally and then hybridized to target DNA under conditions 
which permit hybridization only if a perfect match is found (Saiki et al. (1986) Nature 
324:163); Saiki et al (1989) Proc. Natl Acad. Sci USA 86:6230). Such allele specific 
30 oligonucleotides are hybridized to PCR amplified target DNA or a number of different 
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mutations when the oligonucleotides are attached to the hybridizing membrane and 
hybridized with labeled target DNA. 

Alternatively, allele specific amplification technology which depends on selective 
PCR amplification may be used in conjunction with the instant invention. 
5 Oligonucleotides used as primers for specific amplification may carry the mutation of 
interest in the center of the molecule (so that amplification depends on differential 
hybridization) (Gibbs et al (1989) Nucleic Acids Res. 17:2437-2448) or at the extreme 3* 
end of one primer where, under appropriate conditions, mismatch can prevent, or reduce 
polymerase extension (Prossner (1993) Tibtech 1 1 :238). In addition it may be desirable 

1 0 to introduce a novel restriction site in the region of the mutation to create cleavage-based 
detection (Gasparini et al. (1992) MoL Cell Probes 6:1). It is anticipated that in certain 
embodiments amplification may also be performed using Taq ligase for amplification 
(Barany (1991) Proc. Natl. Acad. Sci USA 88:189). In such cases, ligation will occur 
only if there is a perfect match at the 3' end of the 5 ! sequence making it possible to detect 

15 the presence of a known mutation at a specific site by looking for the presence or absence 
of amplification. 

The methods described herein may be performed, for example, by utilizing pre- 
packaged diagnostic kits comprising at least one probe nucleic acid or antibody reagent 
described herein, which may be conveniently used, e.g., in clinical settings to diagnose 
20 patients exhibiting symptoms or family history of a disease or illness involving an hVR- 
1, hVR-2, and rVR-2 gene. 

Furthermore, any cell type or tissue in which hVR-1, hVR-2, and rVR-2 is 
expressed may be utilized in the prognostic assays described herein. 

25 3. Monitoring of Effects During Clinical Trials 

Monitoring the influence of agents (e.g., drugs) on the expression or activity of 
an hVR-1, hVR-2, and rVR-2 protein can be applied not only in basic drug screening, 
but also in clinical trials. For example, the effectiveness of an agent determined by a 
screening assay as described herein to increase hVR-1, hVR-2, and rVR-2 gene 

30 expression, protein levels, or upregulate hVR-1 , hVR-2, and rVR-2 activity, can be 
monitored in clinical trials of subjects exhibiting decreased hVR-1, hVR-2, and rVR-2 
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gene expression, protein levels, or downregulated hVR-1, hVR-2, and rVR-2 activity. 
Alternatively, the effectiveness of an agent determined by a screening assay to decrease 
hVR-1, hVR-2, and rVR-2 gene expression, protein levels, or downregulate hVR-1, 
hVR-2, and rVR-2 activity, can be monitored in clinical trials of subjects exhibiting 
5 increased hVR- 1 , hVR-2, and rVR-2 gene expression, protein levels, or upregulated 
hVR-1, hVR-2, and rVR-2 activity. In such clinical trials, the expression or activity of 
an hVR-1, hVR-2, and rVR-2 gene, and preferably, other genes that have been 
implicated in, for example, an hVR-1, hVR-2, and rVR-2-associated disorder can be 
used as a "read out H or markers of the phenotype of a particular cell. 
1 0 For example, and not by way of limitation, genes, including hVR- 1 , hVR-2, and 

rVR-2, that are modulated in cells by treatment with an agent (e.g., compound, drug or 
small molecule) which modulates hVR-1, hVR-2, and rVR-2 activity (e.g., identified in 
a screening assay as described herein) can be identified. Thus, to study the effect of 
agents on hVR-1, hVR-2, and rVR-2-associated disorders (e.g., pain disorders), for 
1 5 example, in a clinical trial, cells can be isolated and RNA prepared and analyzed for the 
levels of expression of hVR-1, hVR-2, and rVR-2 and other genes implicated in the 
hVR-1, hVR-2, and rVR-2-associated disorder, respectively. The levels of gene 
expression (e.g, a gene expression pattern) can be quantified by northern blot analysis 
or RT-PCR, as described herein, or alternatively by measuring the amount of protein 
20 produced, by one of the methods as described herein, or by measuring the levels of 
activity of hVR-1, hVR-2, and rVR-2 or other genes. In this way, the gene expression 
pattern can serve as a marker, indicative of the physiological response of the cells to the 
agent. Accordingly, this response state may be determined before, and at various points 
during treatment of the individual with the agent. 
25 I n a embodiment, the present invention provides a method for monitoring the 

effectiveness of treatment of a subject with an agent (e.g., an agonist, antagonist, 
peptidomimetic, protein, peptide, nucleic acid, small molecule, or other drug candidate 
identified by the screening assays described herein) including the steps of (i) obtaining a 
pre-administration sample from a subject prior to administration of the agent; (ii) 
30 detecting the level of expression of an hVR-1, hVR-2, and rVR-2 protein, mRNA, or 
genomic DNA in the preadministration sample; (iii) obtaining one or more post- 
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administration samples from the subject; (iv) detecting the level of expression or activity 
of the hVR-1, hVR-2, and rVR-2 protein, mRNA, or genomic DNA in the post- 
administration samples; (v) comparing the level of expression or activity of the hVR-1, 
hVR-2, and rVR-2 protein, mRNA, or genomic DNA in the pre-administration sample 
5 with the hVR-1, hVR-2, and rVR-2 protein, mRNA, or genomic DNA in the post 

administration sample or samples; and (vi) altering the administration of the agent to the 
subject accordingly. For example, increased administration of the agent may be 
desirable to increase the expression or activity of hVR-1, hVR-2, and rVR-2 to higher 
levels than detected, i.e., to increase the effectiveness of the agent. Alternatively, 
1 0 decreased administration of the agent may be desirable to decrease expression or activity 
of hVR-1, hVR-2, and rVR-2 to lower levels than detected, i.e. to decrease the 
effectiveness of the agent. According to such an embodiment, hVR-1, hVR-2, and rVR- 
2 expression or activity may be used as an indicator of the effectiveness of an agent, 
even in the absence of an observable phenotypic response. 

15 

D. Methods of Treatment : 

The present invention provides for both prophylactic and therapeutic methods of 
treating a subject at risk of (or susceptible to) a disorder or having a disorder associated 
with aberrant hVR-1, hVR-2, and rVR-2 expression or activity. With regards to both 

20 prophylactic and therapeutic methods of treatment, such treatments may be specifically 
tailored or modified, based on knowledge obtained from the field of pharmacogenomics. 
M Pharmacogenomics ,, , as used herein, refers to the application of genomics technologies 
such as gene sequencing, statistical genetics, and gene expression analysis to drugs in 
clinical development and on the market. More specifically, the term refers the study of 

25 how a patients genes determine his or her response to a drug {e.g., a patient's "drug 
response phenotype", or "drug response genotype".) Thus, another aspect of the 
invention provides methods for tailoring an individual's prophylactic or therapeutic 
treatment with either the hVR-1, hVR-2, and rVR-2 molecules of the present invention 
or hVR-1, hVR-2, and rVR-2 modulators according to that individual's drug response 

30 genotype. Pharmacogenomics allows a clinician or physician to target prophylactic or 
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therapeutic treatments to patients who will most benefit from the treatment and to avoid 
treatment of patients who will experience toxic drug-related side effects. 

1. Prophylactic Methods 
5 In one aspect, the invention provides a method for preventing in a subject, a 

disease or condition associated with an aberrant hVR-1, hVR-2, and rVR-2 expression 
or activity, by administering to the subject an hVR-1, hVR-2, and rVR-2 or an agent 
which modulates hVR-1, hVR-2, and rVR-2 expression or at least one hVR-1, hVR-2, 
and rVR-2 activity. Subjects at risk for a disease which is caused or contributed to by 

1 0 aberrant hVR- 1 , hVR-2, and rVR-2 expression or activity can be identified by, for 
example, any or a combination of diagnostic or prognostic assays as described herein. 
Administration of a prophylactic agent can occur prior to the manifestation of symptoms 
characteristic of the hVR-1, hVR-2, and rVR-2 aberrancy, such that a disease or disorder 
is prevented or, alternatively, delayed in its progression. Depending on the type of hVR- 

15 1 , hVR-2, and rVR-2 aberrancy, for example, an h VR- 1 , hVR-2, and rVR-2, hVR- 1 , 
hVR-2, and rVR-2 agonist or hVR-1, hVR-2, and rVR-2 antagonist agent can be used 
for treating the subject. The appropriate agent can be determined based on screening 
assays described herein. 

20 2. Therapeutic Methods 

Another aspect of the invention pertains to methods of modulating hVR- 1 , hVR- 
2, and rVR-2 expression or activity for therapeutic purposes. Accordingly, in an 
exemplary embodiment, the modulatory method of the invention involves contacting a 
cell with an hVR-1, hVR-2, and rVR-2 or agent that modulates one or more of the 

25 activities of hVR-1, hVR-2, and rVR-2 protein activity associated with the cell. An 
agent that modulates hVR-1, hVR-2, and rVR-2 protein activity can be an agent as 
described herein, such as a nucleic acid or a protein, a naturally-occurring target 
molecule of an hVR-1, hVR-2, and rVR-2 protein (e.g., an hVR-1, hVR-2, and rVR-2 
substrate), an hVR-1, hVR-2, and rVR-2 antibody, an hVR-1, hVR-2, and rVR-2 agonist 

30 or antagonist, a peptidomimetic of an hVR-1 , hVR-2, and rVR-2 agonist or antagonist, 
or other small molecule. In one embodiment, the agent stimulates one or more hVR-1, 
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hVR-2, and rVR-2 activities. Examples of such stimulatory agents include active hVR- 

1, hVR-2, and rVR-2 protein and a nucleic acid molecule encoding hVR-1, hVR-2, and 
rVR-2 that has been introduced into the cell. In another embodiment, the agent inhibits 
one or more hVR-1, hVR-2, and rVR-2 activities. Examples of such inhibitory agents 

5 include antisense hVR-1, hVR-2, and rVR-2 nucleic acid molecules, anti-hVR-1, hVR- 

2, and rVR-2 antibodies, and hVR-1, hVR-2, and rVR-2 inhibitors. These modulatory 
methods can be performed in vitro (e.g., by culturing the cell with the agent) or, 
alternatively, in vivo (e.g., by administering the agent to a subject). As such, the present 
invention provides methods of treating an individual afflicted with a disease or disorder 

1 0 characterized by aberrant expression or activity of an hVR-1 , hVR-2, and rVR-2 protein 
or nucleic acid molecule. In one embodiment, the method involves administering an 
agent (e.g., an agent identified by a screening assay described herein), or combination of 
agents that modulates (e.g., upregulates or downregulates) hVR-1, hVR-2, and rVR-2 
expression or activity. In another embodiment, the method involves administering an 

15 hVR-1, hVR-2, and rVR-2 protein or nucleic acid molecule as therapy to compensate for 
reduced or aberrant hVR-1, hVR-2, and rVR-2 expression or activity. 

Stimulation of hVR-1, hVR-2, and rVR-2 activity is desirable in situations in 
which hVR-1, hVR-2, and rVR-2 is abnormally downregulated and/or in which 
increased hVR-1, hVR-2, and rVR-2 activity is likely to have a beneficial effect. For 

20 example, stimulation of hVR-1, hVR-2, and rVR-2 activity is desirable in situations in 
which an hVR-1, hVR-2, and rVR-2 is downregulated and/or in which increased hVR-1, 
hVR-2, and rVR-2 activity is likely to have a beneficial effect. Likewise, inhibition of 
hVR-1, hVR-2, and rVR-2 activity is desirable in situations in which hVR-1, hVR-2, 
and rVR-2 is abnormally upregulated and/or in which decreased hVR-1 , hVR-2, and 

25 rVR-2 activity is likely to have a beneficial effect. 



3. Pharmacogenomics 

The hVR-1, hVR-2, and rVR-2 molecules of the present invention, as well as 
agents, or modulators which have a stimulatory or inhibitory effect on hVR-1, hVR-2, 
30 and rVR-2 activity (e.g., hVR-1 , hVR-2, and rVR-2 gene expression) as identified by a 
screening assay described herein can be administered to individuals to treat 
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(prophylactically or therapeutically) hVR-1, hVR-2, and rVR-2-associated disorders 
(e.g., pain disorders) associated with aberrant hVR-1, hVR-2, and rVR-2 activity. In 
conjunction with such treatment, pharmacogenomics (i.e., the study of the relationship 
between an individual's genotype and that individual's response to a foreign compound 
5 or drug) may be considered. Differences in metabolism of therapeutics can lead to 
severe toxicity or therapeutic failure by altering the relation between dose and blood 
concentration of the pharmacologically active drug. Thus, a physician or clinician may 
consider applying knowledge obtained in relevant pharmacogenomics studies in 
determining whether to administer an hVR-1, hVR-2, and rVR-2 molecule or hVR-1, 
1 0 h VR-2, and rVR-2 modulator as well as tailoring the dosage and/or therapeutic regimen 
of treatment with an hVR-1, hVR-2, and rVR-2 molecule or hVR-1, hVR-2, and rVR-2 
modulator. 

Pharmacogenomics deals with clinically significant hereditary variations in the 
response to drugs due to altered drug disposition and abnormal action in affected 

15 persons. See, for example, Eichelbaum, M. et al. (1996) Clin. Exp. Pharmacol. Physiol. 
23(10-1 1) :983-985 and Linder, M.W. et al. (1997) Clin. Chem. 43(2):254-266. In 
general, two types of pharmacogenetic conditions can be differentiated. Genetic 
conditions transmitted as a single factor altering the way drugs act on the body (altered 
drug action) or genetic conditions transmitted as single factors altering the way the body 

20 acts on drugs (altered drug metabolism). These pharmacogenetic conditions can occur 
either as rare genetic defects or as naturally-occurring polymorphisms. For example, 
glucose-6-phosphate dehydrogenase deficiency (G6PD) is a common inherited 
enzymopathy in which the main clinical complication is haemolysis after ingestion of 
oxidant drugs (anti-malarials, sulfonamides, analgesics, nitrofurans) and consumption of 

25 fava beans. 

One pharmacogenomics approach to identifying genes that predict drug 
response, known as "a genome-wide association", relies primarily on a high-resolution 
map of the human genome consisting of already known gene-related markers (e.g., a "bi- 
allelic" gene marker map which consists of 60,000-100,000 polymorphic or variable 
30 sites on the human genome, each of which has two variants.) Such a high-resolution 
genetic map can be compared to a map of the genome of each of a statistically 
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significant number of patients taking part in a Phase II/III drug trial to identify markers 
associated with a particular observed drug response or side effect. Alternatively, such a 
high resolution map can be generated from a combination of some ten-million known 
single nucleotide polymorphisms (SNPs) in the human genome. As used herein, a 
5 "SNP ,! is a common alteration that occurs in a single nucleotide base in a stretch of 
DNA. For example, a SNP may occur once per every 1000 bases of DNA. A SNP may 
be involved in a disease process, however, the vast majority may not be disease- 
associated. Given a genetic map based on the occurrence of such SNPs, individuals can 
be grouped into genetic categories depending on a particular pattern of SNPs in their 
10 individual genome. In such a manner, treatment regimens can be tailored to groups of 
genetically similar individuals, taking into account traits that may be common among 
such genetically similar individuals. 

Alternatively, a method termed the "candidate gene approach", can be utilized to 
identify genes that predict drug response. According to this method, if a gene that 

1 5 encodes a drugs target is known {e.g. , an hVR-1 , hVR-2, and rVR-2 protein of the 

present invention), all common variants of that gene can be fairly easily identified in the 
population and it can be determined if having one version of the gene versus another is 
associated with a particular drug response. 

As an illustrative embodiment, the activity of drug metabolizing enzymes is a 

20 major determinant of both the intensity and duration of drug action. The discovery of 
genetic polymorphisms of drug metabolizing enzymes {e.g., N-acetyl transferase 2 (NAT 
2) and cytochrome P450 enzymes CYP2D6 and CYP2C19) has provided an explanation 
as to why some patients do not obtain the expected drug effects or show exaggerated 
drug response and serious toxicity after taking the standard and safe dose of a drug. 

25 These polymorphisms are expressed in two phenotypes in the population, the extensive 
metabolizer (EM) and poor metabolizer (PM). The prevalence of PM is different among 
different populations. For example, the gene coding for CYP2D6 is highly polymorphic 
and several mutations have been identified in PM, which all lead to the absence of 
functional CYP2D6. Poor metabolizers of CYP2D6 and CYP2C19 quite frequently 

30 experience exaggerated drug response and side effects when they receive standard doses. 
If a metabolite is the active therapeutic moiety, PM show no therapeutic response, as 
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demonstrated for the analgesic effect of codeine mediated by its C YP2D6-formed 
metabolite morphine. The other extreme are the so called ultra-rapid metabolizers who 
do not respond to standard doses. Recently, the molecular basis of ultra-rapid 
metabolism has been identified to be due to CYP2D6 gene amplification. 
5 Alternatively, a method termed the "gene expression profiling", can be utilized to 

identify genes that predict drug response. For example, the gene expression of an 
animal dosed with a drug (e.g. , an hVR- 1 , hVR-2, and rVR-2 molecule or hVR- 1 , h VR- 
2, and rVR-2 modulator of the present invention) can give an indication whether gene 
pathways related to toxicity have been turned on. 

10 Information generated from more than one of the above pharmacogenomics 

approaches can be used to determine appropriate dosage and treatment regimens for 
prophylactic or therapeutic treatment an individual. This knowledge, when applied to 
dosing or drug selection, can avoid adverse reactions or therapeutic failure and thus 
enhance therapeutic or prophylactic efficiency when treating a subject with an hVR-1, 

15 hVR-2, and rVR-2 molecule or hVR-1, hVR-2, and rVR-2 modulator, such as a 
modulator identified by one of the exemplary screening assays described herein. 

This invention is further illustrated by the following examples which should not 
be construed as limiting. The contents of all references, patents and published patent 
20 applications cited throughout this application, as well as the Figures and the Sequence 
Listing are incorporated herein by reference. 



EXAMPLES 



25 EXAMPLE 1: IDENTIFICATION AND CHARACTERIZATION 

OF hVR-1, hVR-2, and rVR-2 cDNA 

In this example, the identification and characterization of the genes encoding 
hVR-1 (clone Fchrb87a6), hVR-2 (clone flh21el 1), hVR-2 alternate form (clone 
frhobl2c4), and rVR-2 (clone flrxbl47gl 1) are described. 

30 
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Isolation of the hVR-L hVR-2, and the rVR-2 cDNA 

The invention is based, at least in part, on the discovery of two human genes and 
one rat gene encoding novel members of the Capsaicin/V anilloid receptor family, 
referred to herein as hVR-1, hVR-2, and rVR-2, respectively. These clones were 
5 identified from a human heart library and a rat dorsal root ganglion (DRG) library, based 
on sequence homology to the known rat VR-1 (Accession Number AF029310). The 
sequence of the two human clones and the rat clone was determined and found to 
contain open reading frames. 

The nucleotide sequence of the full length hVR-1 cDNA and the predicted amino 
10 acid sequence of the hVR-1 polypeptide are shown in Figure 1 and in SEQ ID NOs:l 
and 2, respectively. 

The nucleotide sequence of the full length hVR-2 cDNA and the predicted amino 
acid sequence of the hVR-2 polypeptide are shown in Figure 2 and in SEQ ID NOs:4 
and 5, respectively. 

1 5 The nucleotide sequence of the partial hVR-2 (alternate form) cDNA and the 

predicted amino acid sequence of the hVR-2 (alternate form) polypeptide are shown in 

Figure 3 and in SEQ ID NOs:7 and 8, respectively. 

The amino acid sequence of the predicted full length human VR-2 protein 

(alternate form) is shown in Figure 16 and in SEQ ID NO:20. 
20 The nucleotide sequence of the partial rVR-2 cDNA and the predicted amino 

acid sequence of the rVR-2 polypeptide are shown in Figure 4 and in SEQ ID NOs: 10 

and 11, respectively. 

Analysis of the hVR-1, hVR-2, and rVR-2 Molecules 

25 The hVR-1 protein (SEQ ID NO:2) was aligned with the human VR-2 protein 

(SEQ ID NO:5) using the GAP program in the GCG software package (Blosum 62 
matrix) and a gap weight of 12 and a length weight of 4. The results showed a 46.348% 
identity and 55.378% similarity between the two sequences (see Figure 5). 

The hVR-1 nucleotide sequence (SEQ ID NO:l) was aligned with the human 

30 VR-2 nucleotide sequence (SEQ ID NO:4) using the GAP program in the GCG software 
package (nwsgapdna matrix) and a gap weight of 50 and a length weight of 3. The 
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results showed a 55.316% identity and 55.316% similarity between the two sequences 
(see Figure 6). 

The hVR-2 protein (SEQ ID NO:5) was aligned with the rat VR-2 protein (SEQ 
ID NO:l 1) using the CLUSTAL W (1.74) multiple sequence alignment program (Figure 
5 7), as well as using the GAP program in the GCG software package (Blosum 62 matrix) 
and a gap weight of 12 and a length weight of 4. The results showed a 79.167% identity 
and 81.703% similarity between the two sequences (see Figure 8). 

The hVR-1 nucleotide sequence (SEQ ID NO:l) was aligned with the rat VR-1 
nucleotide sequence (Accession Number: AF0293 1 0) using the GAP program in the 
1 0 GCG software package (nwsgapdna matrix) and a gap weight of 50 and a length weight 
of 3. The results showed a 82.125% identity and 82.125% similarity between the two 
sequences (see Figure 9). 

The hVR-1 protein (SEQ ID NO:2) was aligned with the rat VR-1 protein 
(Accession Number: AF0293 10) using the GAP program in the GCG software package 
1 5 (Blosum 62 matrix) and a gap weight of 1 2 and a length weight of 4. The results showed 
a 86.022% identity and 89.247% similarity between the two sequences (see Figure 10). 

The hVR-2 protein (SEQ ID NO:5) was aligned with the human VR-2 protein 
(alternate form) (SEQ ID NO:8) using the CLUSTAL W (1.74) multiple sequence 
alignment program (Figure 11). 
20 Finally, the hVR-2 protein (SEQ ID NO:5) was aligned with the predicted full 

length human VR-2 protein (alternate form) (SEQ ID NO:20) using the CLUSTAL W 
(1.74) multiple sequence alignment program (Figure 17). 

A search was performed against the HMM database resulting in the identification 
of three ankyrin repeat domains in the amino acid sequence of hVR-1 (SEQ ID NO:2) at 
25 about residues 201-233, 248-283, and 333-36 1 , and in the amino acid sequence of hVR- 
2 (SEQ ID NO:5) at about residues 162-194, 208-243, and 293-328. The results of the 
searches are set forth in Figures 13 and 15, respectively. 

Hydropathy plots have identified 6 transmembrane domains in the hVR-1 and 
the hVR-2 proteins (see Figures 12 and 14, respectively). 
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A series of searches have revealed that the hVR-1 protein matches the ProDom 
entry 141801 for the vanilloid receptor subtype and the ProDom entry 145518 for the 
vanilloid receptor subtype. 

Moreover, a search was performed against the Prosite database resulting in the 
5 identification of four N-glycosylation sites in the amino acid sequence of SEQ ID NO:5 
(at about residues 171-174, 192-195, 604-607, and 749-752), three cGMP-dependent 
protein kinase phosphorylation sites in the amino acid sequence of SEQ ID NO:5 (at 
about residues 2-5, 368-371, and 499-502), a series of protein kinase C and Casein 
kinase II phosphorylation sites in the amino acid sequence of SEQ ID NO: 5, two 
10 tyrosine kinase phosphorylation sites in the amino acid sequence of SEQ ID NO:5 (at 
about residues 368-375 and 622-628), and two myristoylation sites in the amino acid 
sequence of SEQ ID NO:5 (at about residues 1 69-1 74 and 765-770). 

Tissue Distribution of hVR-I and hVR-2 mRNA 
15 This Example describes the tissue distribution of hVR-1 and hVR-2 mRNA as 

determined by in situ hybridization. 

For in situ analysis, tissues, such as brain regions and whole brain, obtained from 
human and monkey were first frozen on dry ice. Ten-micrometer-thick coronal sections 
of the tissues were postfixed with 4% formaldehyde in DEPC treated IX phosphate- 

20 buffered saline at room temperature for 10 minutes before being rinsed twice in DEPC 
IX phosphate-buffered saline and once in 0.1 M triethanolamine-HCl (pH 8.0). 
Following incubation in 0.25% acetic anhydride-0.1 M triethanolamine-HCI for 10 
minutes, sections were rinsed in DEPC 2X SSC (IX SSC is 0.15M NaCl plus 0.015M 
sodium citrate). Tissue was then dehydrated through a series of ethanol washes, 

25 incubated in 100% chloroform for 5 minutes, and then rinsed in 100% ethanol for 1 
minute and 95% ethanol for 1 minute and allowed to air dry. 

Hybridizations were performed with 35S-radioIabeIed (5 X 10? cpm/ml) cRNA 
probes. Probes were incubated in the presence of a solution containing 600 mM NaCl, 
10 mM Tris (pH 7.5), 1 mM EDTA, 0.01% sheared salmon sperm DNA, 0.01% yeast 

30 tRNA, 0.05% yeast total RNA type XI, 1 X Denhardfs solution, 50% formamide, 10% 
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dextran sulfate, 100 mM dithiothreitol. 0.1% sodium dodecyl sulfate (SDS), and 0.1% 
sodium thiosulfate for 1 8 hours at 55°C. 

After hybridization, slides were washed with 2 X SSC. Sections were then 
sequentially incubated at 37°C in TNE (a solution containing 10 mM Tris-HCl (pH 7.6), 
5 500 mM NaCl, and 1 mM EDTA), for 10 minutes, in TNE with 10|ig of RNase A per ml 
for 30 minutes, and finally in TNE for 10 minutes. Slides were then rinsed with 2 X 
SSC at room temperature, washed with 2 X SSC at 50°C for 1 hour, washed with 0.2 X 
SSC at 55°C for 1 hour, and 0.2 X SSC at 60°C for 1 hour. Sections were then 
dehydrated rapidly through serial ethanol-0.3 M sodium acetate concentrations before 
10 being air dried and exposed to Kodak Biomax MR scientific imaging film for 24 hours 
and subsequently dipped in NB-2 photoemulsion and exposed at 4°C for 7 days before 
being developed and counter stained. 

The data indicate that the hVR-1 molecule is not expressed in human nor 
monkey brain. The hVR-1 molecule is expressed in nodose, trigeminal sensory neurons, 
15 but is not expressed in sympathetic neurons. Within the nodose sensory neurons and 
trigeminal sensory neurons, expression was seen in distinct sub-populations. Moreover, 
hVRl is expressed in some, but not all, small dorsal root ganglion (DRG) neurons and in 
a few medium sized DRG neurons. The hVR-1 molecule is partially co-expressed with 
the neuropeptide CGRP and with substance P which are present in nociceptive neurons. 
20 The data further indicate that the VR-2 molecule is expressed in both human and 

monkey brain, primarily in cortical neurons. The VR2 molecule is also expressed in 
other brain regions, for example, the thalamus, striatum, hippocampus, hypothalamus, 
midbrain, medula and brain stem. In addition, the VR-2 molecule is expressed in 
parasympathetic neurons of the monkey heart (atrium), nodose sensory neurons, 
25 trigeminal (TRG) sensory neurons, dorsal root ganglion sensory neurons, sympathetic 
neurons, and motor neurons of the spinal cord. The VR2 molecule is widely expressed 
in TRG and DRG neurons, being present in most small and medium sized neurons and 
also in a few of the large neurons. VR2, like VR-1 , partially co-localizes with CGRP 
and substance P. 
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Trigeminal sensory neurons are recognized pain centers while sympathetic 
neurons have been shown to be involved in neuropathic pain. 



EXAMPLE 2: EXPRESSION OF RECOMBINANT hVR-1, hVR-2, AND 

5 rVR-2 PROTEIN IN BACTERIAL CELLS 

In this example, hVR-1, hVR-2, and rVR-2 is expressed as a recombinant 
glutathione-S-transferase (GST) fusion polypeptide in E. coli and the fusion polypeptide 
is isolated and characterized. Specifically, hVR-1, hVR-2, and rVR-2 is fused to GST 
and this fusion polypeptide is expressed in E. coli, e.g., strain PEB 199. Expression of 
1 0 the GST-hVR- 1 , GST-hVR-2, and GST-rVR-2 fusion protein in PEB 1 99 is induced 

with IPTG. The recombinant fusion polypeptide is purified from crude bacterial lysates 
of the induced PEB 199 strain by affinity chromatography on glutathione beads. Using 
polyacrylamide gel electrophoretic analysis of the polypeptide purified from the 
bacterial lysates, the molecular weight of the resultant fusion polypeptide is determined. 

15 

EXAMPLE 3: EXPRESSION OF RECOMBINANT hVR-1, hVR-2, 

AND rVR-2 PROTEIN IN COS CELLS 

To express the hVR-1, hVR-2, and rVR-2 gene in COS cells, the pcDN A/Amp 
vector from Invitrogen Corporation (San Diego, CA) is used. This vector contains an 

20 SV40 origin of replication, an ampicillin resistance gene, an E. coli replication origin, a 
CMV promoter followed by a poly linker region, and an SV40 intron and 
polyadenylation site. A DNA fragment encoding the entire hVR-1, hVR-2, and rVR-2 
protein and an HA tag (Wilson et al (1984) Cell 31:161) or a FLAG tag fused in-frame 
to its 3* end of the fragment is cloned into the polylinker region of the vector, thereby 

25 placing the expression of the recombinant protein under the control of the CMV 
promoter. 

To construct the plasmid, the hVR-1, hVR-2, and rVR-2 DNA sequence is 
amplified by PCR using two primers. The 5 1 primer contains the restriction site of 
interest followed by approximately twenty nucleotides of the hVR-1, hVR-2, and rVR-2 
30 coding sequence starting from the initiation codon; the 3' end sequence contains 
complementary sequences to the other restriction site of interest, a translation stop 
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codon, the HA tag or FLAG tag and the last 20 nucleotides of the hVR-1, hVR-2, and 
rVR-2 coding sequence. The PCR amplified fragment and the pCDNA/Amp vector are 
digested with the appropriate restriction enzymes and the vector is dephosphorylated 
using the CIAP enzyme (New England Biolabs, Beverly, MA). Preferably the two 
5 restriction sites chosen are different so that the hVR-1, hVR-2, and rVR-2 gene is 

inserted in the correct orientation. The ligation mixture is transformed into E. coli cells 
(strains HB101, DH5a, SURE, available from Stratagene Cloning Systems, La Jolla, 
CA, can be used), the transformed culture is plated on ampicillin media plates, and 
resistant colonies are selected. Plasmid DNA is isolated from transformants and 
10 examined by restriction analysis for the presence of the correct fragment. 

COS cells are subsequently transfected with the hVR-1, hVR-2, and rVR-2- 
pcDNA/Amp plasmid DNA using the calcium phosphate or calcium chloride co- 
precipitation methods, DEAE-dextran-mediated transfection, lipofection, or 
electroporation. Other suitable methods for transfecting host cells can be found in 
15 Sambrook, J., Fritsh, E. F., and Maniatis, T. Molecular Cloning: A Laboratory Manual. 
2nd, ed, Cold Spring Harbor Laboratory, Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, NY, 1989. The expression of the hVR-1, hVR-2, and rVR-2 polypeptide 
is detected by radiolabelling ( 35 S-methionine or 35 S-cysteine available from NEN, 
Boston, MA, can be used) and immunoprecipitation (Harlow, E. and Lane, D. 
20 Antibodies: A Laboratory ManuaL Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor, NY, 1988) using an HA specific monoclonal antibody. Briefly, the cells are 
labelled for 8 hours with 35 S-methionine (or 35 S-cysteine). The culture media are then 
collected and the cells are lysed using detergents (RIPA buffer, 150 mM NaCl, 1% NP- 
40, 0.1% SDS, 0.5% DOC, 50 mM Tris, pH 7.5). Both the cell lysate and the culture 
25 media are precipitated with an HA specific monoclonal antibody. Precipitated 
polypeptides are then analyzed by SDS-PAGE. 

Alternatively, DNA containing the hVR-1, hVR-2, and rVR-2 coding sequence 
is cloned directly into the polylinker of the pCDNA/Amp vector using the appropriate 
restriction sites. The resulting plasmid is transfected into COS cells in the manner 
30 described above, and the expression of the hVR-1, hVR-2, and rVR-2 polypeptide is 
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detected by radiolabelling and immunoprecipitation using an hVR-L hVR-2 5 and rVR-2 
specific monoclonal antibody. 

EXAMPLE 4: ELECTROPHYSIOLOGICAL STUDIES OF VR2 

5 Human VR2 was functionally characterized in both HEK293 cells and Xenopus 

oocytes using electrophysiological methods. VR2 (in the pcDNA3. 1 vector purchased 
by Invitrogen) was transiently expressed in HEK293 cells (ATCC) and recordings were 
performed 48 hours after transfection of cells using the whole-cell patch-clamp method 
(described in Bertil Hille, Ionin Channels of excitable membranes, 1992; Hammill et al 

10 (1981) Pluger Arch. 391:85-100). The results indicate that heat stimulation (>50 °C) 
induces a rapid inactivating inward current (1-2 nA). Heat-evoked currents of VR2 
displayed profound desensitization and could be reversibly blocked by the VR1 inhibitor 
capsazepin (at a 10 ^iM concentration). In contrast to rat VR1, Capsaicin (at a 1-10 \iM 
concentration), resiniferatoxin (at a 0.1-3 jiM concentration), and low pH (5.0-6.0) do 

1 5 not induce any currents from VR2. Binding studies of [ 3 H]-resiniferatoxin (NEN) to 
both human VR1 and VR2 in membranes isolated from HEK293 cell homogenates also 
indicate that resiniferatoxin (at a 0.1-10 nM concentration) has no specific binding to 
VR2 while it binds to human VR1 with high affinities. 

For the oocyte studies, human VR2 was subcloned into an oocyte expression 

20 vector containing 5'- and 3'-UTR of Xenopus p-globin (Chiara et al. (1999) 

Biochemistry 38(20)6689-6698). In vitro trasncription was carried out as described in 
Chiara et al {supra) and cRNA (10-100 ng) was then injected into the oocytes. VR2 
function was characterized in the oocytes 48 hours after cRNA injection using a standard 
two-electrode voltage-clamp. Consistent with the data from the HEK293 studies, VR2 

25 can only be activated by heat stimulation (48-50 °C) but not by vanilloid receptor 

agonists, capsaicin, or resiniferatoxin. The vanilloid receptor antagonist capsazepine (at 
a 1-10 jaM concentration) blocks the heat response of VR2 reversibly. 
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EXAMPLE 5: GENERATION OF ANTI-hVR-2 ANTIBODIES AND hVR-2 

PROTEIN LOCALIZATION BY IMMUNOSTAINING 

Polyclonal antisera were raised in rabbits against the following three peptides 
derived from the human VR2 amino acid sequence, using the techniques described in Ed 
5 Harlow and David Lane (1988) "Antibodies; A Laboratory Manual" Cold Spring harbor 
Laboratory Press. 

Antibody PEPTIDE 1: AFHCKSPHRHRMVVLE (SEQ ID NO: 13) 
Antibody PEPTIDE 2: RPEAPTGPNATESVQPMEGQEDEGN (SEQ ID NO: 14) 
Antibody PEPTIDE 3: SVLEMENGYWWCRKKQRAG (SEQ ID NO: 15) 
10 Antisera were subsequently affinity purified using the peptide immunogen. 

The polyclonal antisera were tested for immunostaining of both monkey and rat 
dorsal root ganglion sensory neurons. Peptides 1 and 3 gave specific staining of 
subpopulations of sensory neurons that was competed with the corresponding peptide. 
This pattern of expression was very similar to the one observed using a VR-2 riboprobe. 

15 

EXAMPLE 6: CHROMOSOMAL LOCALIZATION OF hVR-1 AND 

hVR-2 

To chromosomally map the hVR-1 gene, primers were designed based on the 
sequence of hVR-1 (clone Fchrb87a6) (amplifying a 1 77 bp product from a human 

20 control cell line DNA and multiple faint larger products from a control Hamster cell line 
DNA by PCR). These primers were used to amplify 93 DNAs in duplicate from the 
Genebridge 4 Radiation Hybrid Panel (Research Genetics, Inc., Huntsville, AL). 

The hVR-1 primers used in the PCR mapping studies were: forward - 
TAGGAGACCCCGTTGCCACG (SEQ ID NO: 16) and reverse - 

25 GATTCACTTGGGGACAGTGACG (SEQ ID NO: 1 7) and the PCR reactions were 
performed as follows: 5 ul Template DNA (lOng/ul), 1.5ul 10X Perkin Elmer PCR 
Buffer, 1.2ul Pharmacia dNTP mix 2.5 mM, 1.1 5ul Forward primer 6.6uM, 1.1 5ul 
Reverse primer 6.6uM, 5ul Gibco/BRL Platinum Taq .05U/ul (Hot Start), using an 
amplification profile of: 95°C for 10 minutes followed by 35 Cycles of 94°C for 40 

30 seconds, 55°C for 40 seconds, 72°C for 40 seconds, and 72°C for 5 minutes. The PCR 
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products were run on 2% agarose gels, post-stainedwith SYBR Gold (1 : 10,000 dilution 
in IX TBE), and scanned on a Molecular Dynamics 595 Fluorimager. 

The following is the vector data for the 93 Genebridge4 hybrid DNAs. These are 
in order 1-93. A "1" is a positive result, a "-" is a negative result, a "?" is an ambiguous 
5 result. 

hVRl 1 - - 1 ? - 1 - 1 - 1 1 - - - 1 - - 1 - 1 1 - - 1 - 1 - 1 - - 1 1-1 1-1 1 1 - - - 

1 ... 1 i J 1-1 — 1 1 1 - 1 - 1 --- 1 1-1 1 

1 0 RH linkage analysis was performed using the Map Manager QTb28 software package. 

hVRl was found to map to the p arm of human chromosome 17, 18.9 cR 3000 
telomeric to the Whitehead Institute framework marker WI-6584, and 7.7 cR 3000 
centromeric of the Whitehead framework marker WI-5436. LOD scores for linkage 
were 14.5 for WI-6584 and 19.3 for WI-5436. This region corresponds to the 

1 5 cytogenetic location 1 7p 1 2- 1 3 . This region is syntenic to mouse chromosome 1 1 . 

To chromosomally map the hVR-2 gene, primers were designed from 5' UTR 
sequence of human VR2 (clone Flh21el 1) (amplifying a 166 bp product from a human 
control cell line DNA and 2 much larger faint bands from a control Hamster cell line 
DNA by PCR). These primers were used to amplify 93 DNAs in duplicate from the 

20 Genebridge 4 Radiation Hybrid Panel (Research Genetics, Inc., Huntsville, AL)/ 
The hVR-2 primers used in the PCR mapping studies were: forward - 
TTAAGCTCCCGTTCTCACCG (SEQ ID NO: 18) and reverse - 
GCTGCGGGAGGAAGTGAAGC (SEQ ID NO: 19) and the PCR reactions were 
performed as follows: 5|al Template DNA (10ng/|il), 1 .5|al 10X Perkin Elmer PCR 

25 Buffer, 1.2(il Pharmacia dNTP mix 2.5mM, 1.15|al Forward primer 6.6jiM, 1.15^1 
Reverse primer 6.6jiM, 5^1 Gibco/BRL Platinum Taq .05U/|iI (Hot Start), using an 
amplification profile of 95°C for 10 minutes, followed by 35 Cycles of 94°C for 40 
seconds, 55°C for 40 seconds, 72°C for 40 seconds, and 72°C for 5 minutes. The PCR 
products were run on 2% agarose gels, post-stainedwith SYBR Gold (1 : 10,000 dilution 

30 in IX TBE), and scanned on a Molecular Dynamics 595 Fluorimager. 



BNSDOCID: <WO 0029577A1JA> 



WO 00/29577 PCT/US99/26701 

-98- 

The following is the vector data for the 93 Genebridge4 hybrid DNAs. These are 
in order 1-93. A "1" is a positive result, a "-" is a negative result, a "?" is an ambiguous 
result. 

5 hVR2 1 - - 1 1 - ? 1 1 -1--1-1 1 1 - 1 1 1 1 - - I - 1 1 - 1 1 1 . . 1 

1 ... 1 1 1 1 1 1-1 1-1-1 1 1 1 1 1 ? 

RH linkage analysis was performed using the Map Manager QTb28 software package. 

10 hVR2 was found to map to the p arm of human chromosome 1 7 , 29.3cR cR 3000 

telomeric to the Whitehead Institute framework marker D17S721, and 23.3 cR 3000 
centromeric of the Whitehead framework marker AFMA043ZB5. LOD scores for 
linkage were 1 1.9 for D17S721 and 13.6 for AFMA043ZB5. This region corresponds to 
the cytogenetic location 17pl 1-12. This region is syntenic to mouse chromosome 11. 

15 

Equivalents 

Those skilled in the art will recognize, or be able to ascertain using no more than 
routine experimentation, many equivalents to the specific embodiments of the invention 
described herein. Such equivalents are intended to be encompassed by the following 
20 claims. 
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1 . An isolated nucleic acid molecule selected from the group consisting of: 

(a) a nucleic acid molecule comprising the nucleotide sequence set 
5 forth in SEQ ID NO:l, 3, 4, 6, 7, 9, 10, or 12 or a complement thereof; and 

(b) a nucleic acid molecule consisting of the nucleotide sequence set 
forth in SEQ ID NO:l, 3, 4, 6, 7, 9, 10, or 12 or a complement thereof. 

2. An isolated nucleic acid molecule which encodes a polypeptide selected 
10 from the group consisting of: 

(a) a polypeptide comprising the amino acid sequence set forth in 
SEQ ID NO:2, 5, 8, or 1 1 ; and 

(b) a polypeptide consisting of the amino acid sequence set forth in 
SEQ ID NO:2, 5, 8, or 11. 

15 

3. An isolated nucleic acid molecule which encodes a naturally occurring 
allelic variant of a polypeptide comprising the amino acid sequence set forth in SEQ ID 
NO:2, 5, 8, or 11. 



20 4. An isolated nucleic acid molecule selected from the group consisting of: 

a) a nucleic acid molecule comprising a nucleotide sequence which 
is at least 83% identical to the nucleotide sequence of SEQ ID NO: 1, 3, 4, 6, 7, 9, 10, or 
12, or a complement thereof; 

b) a nucleic acid molecule comprising a fragment of at least 20 

25 nucleotides of a nucleic acid comprising the nucleotide sequence of SEQ ID NO:l, 3, 4, 
6, 7, 9, 10, or 12, or a complement thereof; 

c) a nucleic acid molecule which encodes a polypeptide comprising 
an amino acid sequence at least about 87% identical to the amino acid sequence of SEQ 
ID NO:2, 5, 8, or 11; and 
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d) a nucleic acid molecule which encodes a fragment of a 
polypeptide comprising the amino acid sequence of SEQ ID NO:2 ? 5, 8, or 1 1, wherein 
the fragment comprises at least 1 5 contiguous amino acid residues of the amino acid 
sequence of SEQ ID NO:2, 5, 8, or 1 1. 

5 

5. An isolated nucleic acid molecule comprising the nucleic acid molecule 
of any one of claims 1, 2, 3, or 4, and a nucleotide sequence encoding a heterologous 
polypeptide. 

10 6. A vector comprising the nucleic acid molecule of any one of claims I, 2, 

3, or 4. 

7. The vector of claim 6, which is an expression vector. 

15 8. A host cell transfected with the expression vector of claim 7. 

9. A method of expressing a polypeptide comprising culturing the host cell 
of claim 8 in an appropriate culture medium to, thereby, express the polypeptide. 
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10. An isolated polypeptide selected from the group consisting of: 

a) a fragment of a polypeptide comprising the amino acid sequence 
of SEQ ID NO:2, 5, 8, or 1 1, wherein the fragment comprises at least 15 contiguous 
amino acids of SEQ ID NO:2, 5, 8, or 1 1 ; 
5 b) a naturally occurring allelic variant of a polypeptide comprising 

the amino acid sequence of SEQ ID NO:2, 5, 8, or 1 1, wherein the polypeptide is 
encoded by a nucleic acid molecule which hybridizes to a nucleic acid molecule 
consisting of SEQ ID NO:l, 3, 4, 6, 7, 9, 10, or 12 under stringent conditions; 

c) a polypeptide which is encoded by a nucleic acid molecule 
10 comprising a nucleotide sequence which is at least 83% identical to a nucleic acid 

comprising the nucleotide sequence of SEQ ID NO:l, 3, 4, 6, 7, 9, 10, or 12; and 

d) a polypeptide comprising an amino acid sequence which is at least 
87% identical to the amino acid sequence of SEQ ID NO:2, 5, 8, or 1 1 . 

15 11. The isolated polypeptide of claim 10 comprising the amino acid sequence 

ofSEQIDNO:2, 5, 8, or 11. 

12. The polypeptide of claim 10, further comprising heterologous amino acid 
sequences. 

20 

13. An antibody which selectively binds to a polypeptide of claim 10. 

14. A method for detecting the presence of a polypeptide of claim 10 in a 
sample comprising: 

25 a) contacting the sample with a compound which selectively binds to 

the polypeptide; and 

b) determining whether the compound binds to the polypeptide in 
the sample to thereby detect the presence of a polypeptide of claim 1 0 in the sample. 

30 15. The method of claim 14, wherein the compound which binds to the 

polypeptide is an antibody. 
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16. A kit comprising a compound which selectively binds to a polypeptide of 
claim 1 0 and instructions for use. 

5 1 7. A method for detecting the presence of a nucleic acid molecule of any 

one of claims 1 , 2, 3, or 4 in a sample comprising: 

a) contacting the sample with a nucleic acid probe or primer which 
selectively hybridizes to the nucleic acid molecule; and 

b) determining whether the nucleic acid probe or primer binds to a 
1 0 nucleic acid molecule in the sample to thereby detect the presence of a nucleic acid 

molecule of any one of claims 1, 2, 3, or 4 in the sample. 

18. The method of claim 17, wherein the sample comprises mRNA 
molecules and is contacted with a nucleic acid probe. 

15 

19. A kit comprising a compound which selectively hybridizes to a nucleic 
acid molecule of any one of claims 1, 2, 3, or 4 and instructions for use. 

20. A method for identifying a compound which binds to a polypeptide of 
20 claim 10 comprising: 

a) contacting the polypeptide, or a cell expressing the polypeptide 
with a test compound; and 

b) determining whether the polypeptide binds to the test compound. 

25 21. The method of claim 20, wherein the binding of the test compound to the 

polypeptide is detected by a method selected from the group consisting of: 

a) detection of binding by direct detection of test 
compound/polypeptide binding; 

b) detection of binding using a competition binding assay; and 

30 c) detection of binding using an assay for hVR- 1 , hVR-2, or rVR-2 

activity. 
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22. A method for modulating the activity of a polypeptide of claim 10 
comprising contacting the polypeptide or a cell expressing the polypeptide with a 
compound which binds to the polypeptide in a sufficient concentration to modulate the 
activity of the polypeptide. 

5 

23. A method for identifying a compound which modulates the activity of a 
polypeptide of claim 10 comprising: 

a) contacting a polypeptide of claim 1 0 with a test compound; and 

b) determining the effect of the test compound on the activity of the 
10 polypeptide to thereby identify a compound which modulates the activity of the 

polypeptide. 

24. A method for treating a subject having a disorder characterized by 
aberrant hVR-1 or hVR-2 protein activity or nucleic acid expression comprising 

15 administering to the subject a hVR-1 or hVR-2 modulator such that treatment of the 
subject occurs. 

25. The method of claim 24, wherein the hVR-1 or hVR-2 modulator is a 
small molecule. 

20 

26. The method of claim 24, wherein the disorder is a pain disorder. 
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humanVRl gene with translation of open reading frame 

Input file Fchrb87a6.seg; Output File Fchrb87a6.tra 
Sequence length 3909 

GTGAGCGCAACGCACTGCGGGCAGTGAGCGCAACGCACTGC6G6CAGTGAGCGCAAC6CACTGCG6GCAGT6AGCGCAA 

CGCACTGCGGGCAGTGAGCGCAACGCACTGCGGGCAGTGAGCGCAACGCACTGCGGGCAGTGAGCGCAACGCACTGCGG 

GCAGTGAGCGCAACGCACTTGCGGGCAGTGAGCGCAACGCACTGCGGGCAGTGAGCGCAACGCACTGCGGGCAGTGAGC 

GCAACGCACTGCGGGCAGTGAGCGCAACGCACTGCGGGCAGTGAGCGCAACGCACTGCGGGCAGTGAGCGCAACGCACT 

GCGGGCAGTGAGCGCAACGCACTGCGGGCAGTGAGCGCAACGCACTGCGGGCAGTGAGCGCAACGCACTGCGGGCAGTG 

AGCGCAACGCACTGCGGGCAGTGAGCGCAACGCACTTAATGTGAGTTAGCTCACTCATTAGGCACCCCAGGCTTTACAC 

TTTATGCTTCCGGCTCGTATGTTGTGTGGAATTGTGAGCGGATAACAATTTCACACAGGAAACAGCTATGACCATGATT 

ACGCCAAGCTCTAATACGACTCACTATAGGGAAAGCTGGTACGCCTGCAGGTACCGGTCCGGAATTCCCGGGTCGACCC 

ACGCGTCCGAAAACACACCTCTCTGCTGTGGGAAGACTGTGCAATGGCACAGCCGCAGAGCTTGGTTTGGGAGGTTGAA 

GTGCTCTGGGGAGAATTCGTAGATCATCCTCAGAAAAGCCTTGCCCTGGTGTTCTACCAGAAAAACGTCTCCCAATCAC 

CCAGAAAAGCTGTCCACAGTAGTCCCCCCTTATCCACGGGTGTCACTTTCCATGGGTTCAGTTATTTGCGGTCAACCAC 

GGTCTGCCAATATTAAATGGAAAATTCTTCAAACAGTTCCCAAGTTTTCCCTTGTGCATTGTTCTGAGCAGTGTGATGA 

AGAGTCTCTGCCGTGCCATCTGGGATGCAAACCGTCCCTGTGTCCCCCACGTCCAGGCCGTAGATGCTCCCCGCCGGTC 

AGTCACTTAGTCGTCAGATCGCCCGTCCTGGTATCACAGTGCTTCTGTTCAGGTTGCACACTGGGCCACAGAGGATCCA 

MKKWSSTDLGTAADPLQK 18 
GCAAGG ATG AAG AAA TGG AGC AGC ACA GAC TTG GGG ACA GCT GCG GAC CCA CTC CAA AAG 54 

DTCPDPLDGDPNSR PPPAKP38 
GAC ACC TGC CCA GAC CCC CTG GAT GGA GAC CCT AAC TCC AGG CCA CCT CCA GCC AAG CCC 114 

QLPTAKSRTRLFGKGDSEEA58 
CAG CTC CCC ACG GCC AAG AGC CGC ACC CGG CTC TTT GGG AAG GGT GAC TCG GAG GAG GCT 174 

FPVDCPHEEGELDSCPTITV78 
TTC CCG GTG GAT TGC CCC CAC GAG GAA GGT GAG TTG GAC TCC TGC CCG ACC ATC ACA GTC 234 

SPVITIQRPGDGPTGARLLS98 
AGC CCT GTT ATC ACC ATC CAG AGG CCA GGA GAC GGC CCC ACC GGT GCC AGG CTG CTG TCC 294 
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QDSVAASTEKTLRLYDRRS I 118 
CAG GAC TCT GTC GCC GCC AGC ACC GAG AAG ACC CTC AGG CTC TAT GAT CGC AGG AGT ATC 354 

FEAVAQNNCQDLESLLLFLQ 138 
TTT GAA GCC GTT GCT CAG AAT AAC TGC CAG GAT CTG GAG AGC CTG CTG CTC TTC CTG CAG 414 



KSKKHLTDNE 
AAG AGC AAG AAG CAC CTC ACA GAC AAC GAG 


F K 
TTC AAA 


D 
GAC 


PET 
CCT GAG ACA 


G 
GGG 


K 
AAG 


T 

ACC 


C 
TGT 


158 
474 


LLKAMLNLHD 
CTG CTG AAA GCC ATG CTC AAC CTG CAC GAC 


G Q 
GGA CAG 


N 
AAC 


T T I 
ACC ACC ATC 


P 
CCC 


L 

CTG 


L 
CTC 


L 
CTG 


178 
534 


EIARQTDSLK 
GAG ATC GCG CGG CAA ACG GAC AGC CTG AAG 


E L 
GAG CTT 


V 

GTC 


N A S 
AAC GCC AGC 


Y 
TAC 


T 
ACG 


D 
GAC 


S 
AGC 


198 
594 


YYKGQTALHI 
TAC TAC AAG GGC CAG ACA GCA CTG CAC ATC 


A I 
GCC ATC 


E 
GAG 


R R N 
AGA CGC AAC 


M 
ATG 


A 

GCC 


L 
CTG 


V 

GTG 


218 
654 


TLLVENGADV 
ACC CTC CTG GTG GAG AAC GGA GCA GAC GTC 


Q A 

CAG GCT 


A 
GCG 


A H G 
GCC CAT GGG 


D 
GAC 


F 
TTC 


F 
TTT 


K 
AAG 


238 
714 


KTKGRPGFYF 
AAA ACC AAA GGG CGG CCT GGA TTC TAC TTC 


G E 
GGT GAA 


L 
CTG 


P L S 
CCC CTG TCC 


L 
CTG 


A 

GCC 


A 
GCG 


C 
TGC 


258 
774 


TNQLGIVKFL 
ACC AAC CAG CTG GGC ATC GTG AAG TTC CTG 


L Q 
CTG CAG 


N 
AAC 


S W Q 
TCC TGG CAG 


T 
ACG 


A 

GCC 


D 
GAC 


I 
ATC 


278 
834 


SARDSVGNTV 
AGC GCC AGG GAC TCG GTG GGC AAC ACG GTG 


L H 
CTG CAC 


A 

GCC 


L V E 
CTG GTG GAG 


V 
GTG 


A 

GCC 


D 
GAC 


N 
AAC 


298 
894 


TADNTKFVTS 
ACG GCC GAC AAC ACG AAG TTT GTG ACG AGC 


M Y 
ATG TAC 


N 
AAT 


E I L 
GAG ATT CTG 


M 
ATG 


L 
CTG 


G 
GGG 


A 

GCC 


318 
954 


KLHPTLKLEE 
AAA CTG CAC CCG ACG CTG AAG CTG GAG GAG 


L T 
CTC ACC 


N 
AAC 


K K G 
AAG AAG GGA 


M 
ATG 


T 
ACG 


P 
CCG 


L 338 
CTG 1014 


ALAAGTGKIG 
GCT CTG GCA GCT GGG ACC GGG AAG ATC GGG 


V L 
GTC TTG 


A 

GCC 


Y I L 
TAT ATT CTC 


Q 
CAG 


R 

CGG 


E 
GAG 


I 
ATC 


358 
1074 


QEPECRHLSR 
CAG GAG CCC GAG TGC AGG CAC CTG TCC AGG 


K F 
AAG TTC 


T 
ACC 


E W A 
GAG TGG GCC 


Y 
TAC 


G 
GGG 


P 
CCC 


V 

GTG 


378 
1134 


HSSLYDLSCI 
CAC TCC TCG CTG TAC GAC CTG TCC TGC ATC 


D T 
GAC ACC 


C 
TGC 


E K N 
GAG AAG AAC 


S 
TCG 


V 

GTG 


L 
CTG 


E 398 
GAG 1194 


VIAYSSSETP 
GTG ATC GCC TAC AGC AGC AGC GAG ACC CCT 


N R 
AAT CGC 


H 
CAC 


DHL 
GAC ATG CTC 


L 
TTG 


V 

GTG 


E 
GAG 


P 418 
CCG 1254 
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LNRLLQDKWDRFVKRIFYFN438 
CT6 AAC CGA CTC CTG CAG GAC AAG TGG GAC AGA TTC GTC AAG CGC ATC TTC TAC TTC AAC 1314 

FLVYCLYMI I FTMAAYYRPV 458 
TTC CTG GTC TAC TGC CTG TAC ATG ATC ATC TTC ACC ATG GCT GCC TAC TAC AGG CCC GTG 1374 

DGLPPFKMEKIGDYFRVTGE478 
GAT GGC TTG CCT CCC TTT AAG ATG GAA ARA ATT GGA GAC TAT TTC CGA GTT ACT GGA GAG 1434 

ILSVLGGVYFFFRGIQYFLQ498 
ATC CTG TCT GTG TTA GGA GGA GTC TAC TTC TTT TTC CGA GGG ATT CAG TAT TTC CTG CAG 1494 

RRPSMKTLFVDSYSEMLF FL 518 
AGG CGG CCG TCG ATG AAG ACC CTG TTT GTG GAC AGC TAC AGT GAG ATG CTT TTC TTT CTG 1554 

QSLFMLATVVLYFSHLKEYV 538 
CAG TCA CTG TTC ATG CTG GCC ACC GTG GTG CTG TAC TTC AGC CAC CTC AAG GAG TAT GTG 1614 

ASMVFSLALGWTNMLYYTRG558 
GCT TCC ATG GTA TTC TCC CTG GCC TTG GGC TGG ACC AAC ATG CTC TAC TAC ACC CGC GGT 1674 

FQQMGIYAVMIEKMILRDLC578 
TTC CAG CAG ATG GGC ATC TAT GCC GTC ATG ATA GAG AAG ATG ATC CTG AGA GAC CTG TGC 1734 

R F M F V Y IVFLFGFSTAVVTL 598 
CGT TTC ATG TTT GTC TAC ATC GTC TTC TTG TTC GGG TTT TCC ACA GCG GTG GTG ACG CTG 1794 

IEDGKNDSLPSESTSHRWRG618 
ATT GAA GAC GGG AAG AAT GAC TCC CTG CCG TCT GAG TCC ACG TCG CAC AGG TGG CGG GGG 1854 

PA CRPPDSSYNSLYSTCLEL638 
CCT GCC TGC AGG CCC CCC GAT AGC TCC TAC AAC AGC CTG TAC TCC ACC TGC CTG GAG CTG 1914 

FKFTIGMGDLEFTENYDFKA658 
TTC AAG TTC ACC ATC GGC ATG GGC GAC CTG GAG TTC ACT GAG AAC TAT GAC TTC AAG GCT 1974 

V F I ILLLAYVILTYILLLHM 678 
GTC TTC ATC ATC CTG CTG CTG GCC TAT GTA ATT CTC ACC TAC ATC CTC CTG CTC AAC ATG 2034 

LIALMGETVNKIAQESK NIW698 
CTC ATC GCC CTC ATG GGT GAG ACT GTC AAC AAG ATC GCA CAG GAG AGC AAG AAC ATC TGG 2094 

KLQRAITILDTEKSFLKCMR718 
AAG CTG CAG AGA GCC ATC ACC ATC CTG GAC ACG GAG AAG AGC TTC CTT AAG TGC ATG AGG 2154 
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KAFRSGKLLQVGYTPDGKDD 738 
AAG GCC TTC CGC TCA GGC AAG CTG CTG CAG GTG GGG TAC ACA CCT GAT GGC AAG GAC GAC 2214 

YRWCFRVDEVNWTTWNTNVG 758 
TAC CGG TGG TGC TTC AGG GTG GAC GAG GTG AAC TGG ACC ACC TGG AAC ACC AAC GTG GGC 2274 

IINEDPGNCEGVKRTLSFSL 778 
ATC ATC AAC GAA GAC CCG GGC AAC TGT GAG GGC GTC AAG CGC ACC CTG AGC TTC TCC CTG 2334 

RS SRVSGRHWKNFALVPLLR 798 
CGG TCA AGC AGA GTT TCA GGC AGA CAC TGG AAG AAC TTT GCC CTG GTC CCC CTT TTA AGA 2394 

EASARDRQSAQPEEVYLRQF818 
GAG GCA AGT GCT CGA GAT AGG CAG TCT GCT CAG CCC GAG GAA GTT TAT CTG CGA CAG TTT 2454 

SGSLKPEDAEVFKSPAASGE 838 
TCA GGG TCT CTG AAG CCA GAG GAC GCT GAG GTC TTC AAG AGT CCT GCC GCT TCC GGG GAG 2514 

K * 840 
AAG TGA 2520 

GGACGTCACGCAGACAGCACTGTCAACACTGGGCCTTAGGAGACCCCGTTGCCACGGGGGGCTGCTGAGGGAACACCAG 

TGCTCTGTCAGCAGCCTGGCCTGGTCTGTGCCTGCCCAGCATGTTCCCAAATCTGTGCTGGACAAGCTGTGGGAAGCGT 

TCTTGGAAGCATGGGGAGTGATGTACATCCAACCGTCACTGTCCCCAAGTGAATCTCCTAACAGACTTTCAGGTTTTTA 

CTCACTTTACTAAAAAAAAAAAAAAAAGGGCGGCCGCTTA 
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Full-length human VR2 

Input file Flh21ell.seq; Output File Flh21ell.tra 
Sequence length 2809 

GGCTAGCCTGTCCTGACAGGGGAGAGTTAAGCTCCC6TTCTCCACCGTGCCGGCTGGCCAGGTGGGCTGAGGGTGACCG 

AGAGACCAGAACCTGCTTGCTGGAGCTTAGTGCTCAGAGCTGGGGAGGGAGGTTCCGCCGCTCCTCTGCTGTCAGCGCC 

GGCAGCCCCTCCCGGCTTCACTTCCTCCCGCAGCCCCTGCTACTGAGAAGCTCCGGGATCCCAGCAGCCGCCACGCCCT 

GGCCTCAGCCTGCGGGGCTCCAGTCAGGCCAACACCGACGCGCAGCTGGGAGGAAGACAGGACCCTTGACATCTCCATC 

MTSPSSSP 8 
TGCACAGAGGTCCTGGCTGGACCGAGCAGCCTCCTCCTCCTAGG ATG ACC TCA CCC TCC AGC TCT CCA 24 

VFRLETLDGGQEDGSEADRG 28 
GTT TTC AGG TTG GAG ACA TTA GAT GGA GGC CAA GAA GAT GGC TCT GAG GCG GAC AGA GGA 84 

KLDFGSGLPPMESQFQGEDR 48 
AAG CTG GAT TTT GGG AGC GGG CTG CCT CCC ATG GAG TCA CAG TTC CAG GGC GAG GAC CGG 144 

K FA P Q I R VNLNYRK GTGAS Q 68 
AAA TTC GCC CCT CAG ATA AGA GTC AAC CTC AAC TAC CGA AAG GGA ACA GGT GCC AGT CAG 204 

PDPNRFDRDRLFNAVSRGVP 88 
CCG GAT CCA AAC CGA TTT GAC CGA GAT CGG CTC TTC AAT GCG GTC TCC CGG GST GTC CCC 264 

EDLAGLPEYLSRTS RYLTDS 108 
GAG GAT CTG GCT GGA CTT CCA GAG TAC CTG AGC AAG ACC AGC AAG TAC CTC ACC GAC TCG 324 

EYTEGSTGKTCLMKAVLNLK128 
GAA TAC ACA GAG GGC TCC ACA GGT AAG ACG TGC CTG ATG AAG GCT GTG CTG AAC CTT AAG 384 

DGVNAC ILPLLQIDRDSGNP 148 
GAC GGA GTC AAT GCC TGC ATT CTG CCA CTG CTG CAG ATC GAC AGG GAC TCT GGC AAT CCT 444 

QPLVNAQCTDDYYRGHSALH168 
CAG CCC CTG GTA AAT GCC CAG TGC ACA GAT GAC TAT TAC CGA GGC CAC AGC GCT CTG CAC 504 

IAIEKRSLQCVKLLVENGAN188 
ATC GCC ATT GAG AAG AGG AGT CTG CAG TGT GTG AAG CTC CTG GTG GAG AAT GGG GCC AAT 564 

VHARACGRFFQK GQGTCFYF208 
GTG CAT GCC CGG GCC TGC GGC CGC TTC TTC CAG AAG GGC CAA GGG ACT TGC TTT TAT TTC 624 

GELPLSLAACTKQWDVVSYL228 
GGT GAG CTA CCC CTC TCT TTG GCC GCT TGC ACC AAG CAG TGG GAT GTG GTA AGC TAC CTC 684 
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LENPHQPASLQATDSQGNTV 248 
CTG GAG AAC CCA CAC CAG CCC GCC AGC CTG CAG GCC ACT GAC TCC CAG GGC AAC ACA GTC 744 

LHALVMISDNSAENIALVTS 268 
CTG CAT GCC CTA GTG ATG ATC TCG GAC AAC TCA GCT GAG AAC ATT GCA CTG GTG ACC ACC 804 

MYDGLLQAGARLCPTVQLED 288 
ATG TAT GAT GGG CTC CTC CAA GCT GGG GCC CGC CTC TGC CCT ACC GTG CAG CTT GAG GAC 864 

IRNLQDLTPLKLAAKEGKIE 308 
ATC GCC AAC CTG CAG GAT CTC ACG CCT CTG AAG CTG GCC GCC AAG GAG GGC AAG ATC GAG 924 

IFRHILQREFSGLSHLSRKF 328 
ATT TTC AGG CAC ATC CTG CAG CGG GAG TTT TCA GGA CTG AGC CAC CTT TCC CGA AAG TTC 984 

TEWCYGPVRVSLYDLASVDS 348 
ACC GAG TGG TGC TAT GGG CCT GTC CGG GTG TCG CTG TAT GAC CTG GCT TCT GTG GAC AGC 1044 

CEENSVLEIIAFHCKSPHRH 368 
TGT GAG GAG AAC TCA GTG CTG GAG ATC ATT GCC TTT CAT TGC AAG AGC CCG CAC CGA CAC 1104 

RMVVLEPLNKLLQAKWDLLI 388 
CGA ATG GTC GTT TTG GAG CCC CTG AAC AAA CTG CTG CAG GCG AAA TGG GAT CTG CTC ATC 1164 

PKFFLNFLCNLIYMFIFTAV 408 
CCC AAG TTC TTC TTA AAC TTC CTG TGT AAT CTG ATC TAC ATG TTC ATC TTC ACC GCT GTT 1224 

AYHQPTLKKQAAPHLKAEVG 428 
GCC TAC CAT CAG CCT ACC CTG AAG AAG CAG GCC GCC CCT CAC CTG AAA GCG GAG GTT GGA 1284 

NSMLLTGHILILLGGIYLLV 448 
AAC TCC ATG CTG CTG ACG GGC CAC ATC CTT ATC CTG CTA GGG GGG ATC TAC CTC CTC GTG 1344 

GQLWYFWRRHVFIWISFIDS 468 
GGC CAG CTG TGG TAC TTC TGG CGG CGC CAC GTG TTC ATC TGG ATC TCG TTC ATA GAC AGC 1404 

YFEILFLFQALLTVVSQVLC 488 
TAC TTT GAA ATC CTC TTC CTG TTC CAG GCC CTG CTC ACA GTG GTG TCC CAG GTG CTG TGT 1464 

FLAIEWYLPLLVSALVLGWL 508 
TTC CTG GCC ATC GAG TGG TAC CTG CCC CTG CTT GTG TCT GCG CTG GTG CTG GGC TGG CTG 1524 

NLLYYTRGFQHTGIYSVMIQ 528 
AAC CTG CTT TAC TAT ACA CGT GGC TTC CAG CAC ACA GGC ATC TAC AGT GTC ATG ATC CAG 1584 
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KVILRDLLRFLLIYLVFLFG548 
AAG GTC ATC CTG CGG GAC CTG CTG CGC TTC CTT CTG ATC TAC TTA GTC TTC CTT TTC GGC 1644 

FAVALVSLSQEAWRPEAPTG 568 
TTC GCT GTA GCC CTG GTG AGC CTG AGC CAG GAG GCT TGG CGC CCC GAA GCT CCT ACA GGC 1704 

PNATESVQPMEGQEDEGNGA 588 
CCC AAT GCC ACA GAG TCA GTG CAG CCC ATG GAG GGA CAG GAG GAC GAG GGC AAC GGG GCC 1764 

QYRGILEASLELFKFTIGMG608 
CAG TAC AGG GGT ATC CTG GAA GCC TCC TTG GAG CTC TTC AAA TTC ACC ATC GGC ATG GGC 1824 

ELAFQEQLHFRGMVLLLLLA628 
GAG CTG GCC TTC CAG GAG CAG CTG CAC TTC CGC GGC ATG GTG CTG CTG CTG CTG CTG GCC 1884 

YVLLTYILLLNMLIALMSET648 
TAC GTG CTG CTC ACC TAC ATC CTG CTG CTC AAC ATG CTC ATC GCC CTC ATG AGC GAG ACC 1944 

VNSVATDSWSIWKLQKAISV 668 
GTC AAC AGT GTC GCC ACT GAC AGC TGG AGC ATC TGG AAG CTG CAG AAA GCC ATC TCT GTC 2004 

LEMENGYWWCRKKQRAGVML688 
CTG GAG ATG GAG AAT GGC TAT TGG TGG TGC AGG AAG AAG CAG CGG GCA GGT GTG ATG CTG 2064 

TVGTKPDGSPDERWCFRVEE 708 
ACC GTT GGC ACT AAG CCA GAT GGC AGC CCG GAT GAG CGC TGG TGC TTC AGG GTG GAG GAG 2124 

VNWASWEQTLPTLCEDPSGA 728 
GTG AAC TGG GCT TCA TGG GAG CAG ACG CTG CCT ACG CTG TGT GAG GAC CCG TCA GGG GCA 2184 

GVPRTLENPVLASP P K E D E D 748 
GGT GTC CCT CGA ACT CTC GAG AAC CCT GTC CTG GCT TCC CCT CCC AAG GAG GAT GAG GAT 2244 

GASEENYVPVQLLQSN* 765 
GGT GCC TCT GAG GAA AAC TAT GTG CCC GTC CAG CTC CTC CAG TCC AAC TGA 2295 

TGGCCCAGATGCAGCAGGAGGCCAGAGGACAGAGCAGAGGATCTTTCCAACCACATCTGCTGGCTCTGGGGTCCCAGTG 
AATTCTGGTGGCAAATATATATTTTCACTAACTCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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Partial human VR2 alternate form 

Input file frhobl2c4.seg; Output File frhobl2c4.tra 
Sequence length 1489 

GRFFQKGQGTCFYFGELPL 19 
GC GGC CGC TTC TTC CAG AAG GGC CAA GGG ACT TGC TTT TAT TTC GGT GAG CTA CCC CTC 57 



s 

TCT 


L 
TTG 


A A C T K 
GCC GCT TGC ACC AAG 


Q 
CAG 


W 

TGG 


D 
GAT 


V V 
GTG GTA 


S 
AGC 


Y 
TAC 


L 
CTC 


L 
CTG 


E N P H 
GAG AAC CCA CAC 


39 
117 


Q 

CAG 


P 
CCC 


A S L Q A 
GCC AGC CTG CAG GCC 


T 
ACT 


D 
GAC 


S 
TCC 


Q 6 
CAG GGC 


N 
AAC 


T 
ACA 


V 
GTC 


L 
CTG 


H A L V 
CAT GCC CTA GTG 


59 
177 


M 

ATG 


I 

ATC 


S D N S A 
TCG GAC AAC TCA GCT 


E 
GAG 


N 
AAC 


I 
ATT 


A L 
GCA CTG 


V 
GTG 


T 

ACC 


S 
AGC 


H 
ATG 


Y D G L 
TAT GAT GGG CTC 


79 
237 


L 

CTC 


Q 
CAA 


A G A R L 
GCT GGG GCC CGC CTC 


C 
TGC 


P 
CCT 


T 
ACC 


V Q 
GTG CAG 


L 
CTT 


E 
GAG 


D 
GAC 


I 
ATC 


R N L Q 
CGC AAC CTG CAG 


99 
297 


D 

GAT 


L 
CTC 


T P L K L 
ACG CCT CTG AAG CTG 


A 

GCC 


A 
GCC 


K 
AAG 


E G 
GAG GGC 


K 
AAG 


I 
ATC 


E 
GAG 


I 
ATT 


F R H I 
TTC AGG CAC ATC 


119 
357 


L 

CTG 


Q 
CAG 


R E F S G 
CGG GAG TTT TCA GGA 


L 
CTG 


S 
AGC 


H 
CAC 


L S 
CTT TCC 


R 
CGA 


K 
AAG 


F 
TTC 


T 
ACC 


E W C Y 
GAG TGG TGC TAT 


139 
417 


G 

GGG 


P 
CCT 


V R V S L 
GTC CGG GTG TCG CTG 


Y 

TAT 


D 
GAC 


L 
CTG 


A S 
GCT TCT 


V 
GTG 


D 
GAC 


S 
AGC 


C 
TGT 


E E N S 
GAG GAG AAC TCA 


159 
477 


V 

GTG 


L 
CTG 


E I I A F 
GAG ATC ATT GCC TTT 


H 
CAT 


C 

TGC 


K 
AAG 


S P 
AGC CCG 


H 
CAC 


R 
CGA 


H 
CAC 


R 
CGA 


M V V L 
ATG GTC GTT TTG 


1 *7 ft 

179 
537 


E 

GAG 


P 
CCC 


L N K L L 
CTG AAC AAA CTG CTG 


Q 
CAG 


A 
GCG 


K 
AAA 


W D 
TGG GAT 


L 
CTG 


L 
CTC 


I 

ATC 


P 

CCC 


K F F L 
AAG TTC TTC TTA 


199 
597 


N 
AAC 


F 
TTC 


L C N L I 
CTG TGT AAT CTG ATC 


Y 
TAC 


M 
ATG 


F 
TTC 


I F 
ATC TTC 


T 
ACC 


A 

GCT 


V 

GTT 


A 

GCC 


Y H Q P 
TAC CAT CAG CCT 
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FLFQALLTVVSQVLCFLAIE299 
TTC CTG TTC CAG GCC CTG CTC ACA GTG GTG TCC CAG GTG CTG TGT TTC CTG GCC ATC GAG 897 

WYLPLLVSALVLGWLNLLYY 319 
TGG TAC CTG CCC CTG CTT GTG TCT GCG CTG GTG CTG GGC TGG CTG AAC CTG CTT TAC TAT 957 

TRGFQHTGIYSVMIQKKAIS339 
ACA CGT GGC TTC CAG CAC ACA GGC ATC TAC AGT GTC ATG ATC CAG AAG AAA GCC ATC TCT 1017 

VLEMENGYWWCRKKQRAGVM359 
GTC CTG GAG ATG GAG AAT GGC TAT TGG TGG TGC AGG AAG AAG CAG CGG GCA GGT GTG ATG 1077 

LTVGTKPDGSPDERWCFRVE 379 
CTG ACC GTT GGC ACT AAG CCA GAT GGC AGC CCG GAT GAG CGC TGG TGC TTC AGG GTG GAG 1137 

EVNWASWEQTLPTLCEDPSG 399 
GAG GTG AAC TGG GCT TCA TGG GAG CAG ACG CTG CCT ACG CTG TGT GAG GAC CCG TCA GGG 1197 

AGVPRTLENPVLASPPKEDE419 
GCA GGT GTC CCT CGA ACT CTC GAG AAC CCT GTC CTG GCT TCC CCT CCC AAG GAG GAT GAG 1257 

DGASEENYVPVQLLQSN* 437 
GAT GGT GCC TCT GAG GAA AAC TAT GTG CCC GTC CAG CTC CTC CAG TCC AAC TGA 1311 

TGGCCCAGATGCAGCAGGAGGCCAGAGGACAGAGCAGAGGATCTTTCCAACCACATCTGCTGGCTCTGGGGTCCCAGTG 

AATTCTGGTGGCAAATATATATTTTCACTAACTCAAAAAAA 

GCGGACGCGTGGGTCGAC 
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Partial rat VR2 

Input file Flrxbl47gll.seq; Output File Flrxbl47gll.tra 
Sequence length 1794 

STHASALSLAACTKQWDVV 19 

G TCG ACC CAC GCG TCC GCT CTT TCT CTG GCT GCG TGC ACC AAG CAG TGG GAT GTG GTG 57 

TYLLENPHO.PASLEATDS L G 39 

ACC TAC CTC CTG GAG AAC CCA CAC CAG CCG GCC AGC CTG GAG GCC ACC GAC TCC CTG GGC 117 

NTVLHALVMIADNSPENSAL 59 

AAC ACA GTC CTG CAT GCT CTG GTA ATG ATT GCA GAT AAC TCG CCT GAG AAC AGT GCC CTG 177 

VIHMYDGLLQMGARLCPTVQ 79 

GTG ATC CAC ATG TAC GAC GGG CTT CTA CAA ATG GGG GCG CGC CTC TGC CCC ACT GTG CAG 237 

LEEISNHQGLTPLKLAAKEG 99 

CTT GAG GAA ATC TCC AAC CAC CAA GGC CTC ACA CCC CTG AAA CTA GCC GCC AAG GAA GGC 297 

KIEIFRHILQREFSGPYQPL 119 

AAA ATC GAG ATT TTC AGG CAC ATT CTG CAG CGG GAA TTC TCA GGA CCG TAC CAG CCC CTT 357 

SRKFTEWCYGPVRVSLYDLS 139 

TCC CGA AAG TTT ACT GAG TGG TGT TAC GGT CCT GTG CGG GTA TCG CTG TAC GAC CTG TCC 417 

SVDSWEKNSVLEIIAFHCKS 159 

TCT GTG GAC AGC TGG GAA AAG AAC TCG GTG CTG GAG ATC ATC GCT TTT CAT TGC AAG AGC 477 

PNRHRMVVLEPLNKLLQEKW 179 

CCG AAC CGG CAC CGC ATG GTG GTT TTA GAA CCA CTG AAC AAG CTT CTG CAG GAG AAA TGG 537 

DRLVSRPFFNFACYLVYHFI 199 

GAT CGG CTC GTC TCA AGA TTC TTC TTC AAC TTC GCC TGC TAC TTG GTC TAC ATG TTC ATC 597 

FTVVAYHQPSLDQPAIPSSK 219 

TTC ACC GTC GTT GCC TAC CAC CAG CCT TCC CTG GAT CAG CCA GCC ATC CCC TCA TCA AAA 657 

ATFGESMLLLGHILILLGGI 239 

GCG ACT TTT GGG GAA TCC ATG CTG CTG CTG GGC CAC ATT CTG ATC CTG CTT GGG GGT ATT 717 

YLLLGQLWYFWRRRLFIWI S 259 

TAC CTC TTA CTG GGC CAG CTG TGG TAC TTT TGG CGG CGG CGC CTG TTT ATC TGG ATC TCA 777 

FMDSYFEILFLLQALLTVLS 279 

TTC ATG GAC AGC TAC TTT GAA ATC CTC TTT CTC CTT CAG GCT CTG CTC ACA GTG CTG TCC 837 
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QVLRFMETEWYLPLLVLSLV 299 

CAG GTG CTG CGC TTC ATG GAG ACT GAA TGG TAC CTA CCC CTG CTA GTG TTA TCC CTA GTG 897 

LGWLNLLYYTRGFQHTGIYS 319 

CTG GGC TGG CTG AAC CTG CTT TAC TAC ACA CGG GGC TTT CAG CAC ACA GGC ATC TAC AGT 957 

VMIQKVILRDLLRFLLVYLV 339 

GTC ATG ATC CAG AAG GTC ATC CTT CGA GAC CTG CTC CGT TTC CTG CTG GTC TAC CTG GTC 1017 

FLFGFAVALVSLSREARSPK 359 

TTC CTT TTC GGC TTT GCT GTA GCC CTA GTA AGC TTG AGC AGA GAG GCC CGA AGT CCC AAA 1077 

APEDNNSTVTEQPTVGQEEE 379 

GCC CCT GAA GAT AAC AAC TCC ACA GTG ACG GAA CAG CCC ACG GTG GGC CAG GAG GAG GAG 1137 

PAPYRSILDASLELFKFTIG 399 

CCA GCT CCA TAT CGG AGC ATT CTG GAT GCC TCC CTA GAG CTG TTC AAG TTC ACC ATT GGT 1197 

MGELAFQEQLRFRGVVLLLL 419 

ATG GGG GAG CTG GCT TTC CAG GAA CAG CTG CGT TTT CGT GGG GTG GTC CTG CTG TTG CTG 1257 

LAYVLLTYVLLLNMLIALMS 439 

TTG GCC TAC GTC CTT CTC ACC TAC GTC CTG CTG CTC AAC ATG CTC ATT GCT CTC ATG AGC 1317 

ETVNHVADNSWSIWKLQKAI 459 

GAA ACT GTC AAC CAC GTT GCT GAC AAC AGC TGG AGC ATC TGG AAG TTG CAG AAA GCC ATC 1377 

SVLEMENGYWWCRRKKHREG 479 

TCT GTC TTG GAG ATG GAG AAT GGT TAC TGG TGG TGC CGG AGG AAG AAA CAT CGT GAA GGG 1437 

RLLKVGTRGDGTPDERWCFR 499 

AGG CTG CTG AAA GTC GGC ACC AGG GGG GAT GGT ACC CCT GAT GAG CGC TGG TGC TTC AGG 1497 

VEEVNWAAWEKTLPTLSEDP 519 

GTG GAG GAA GTA AAT TGG GCT GCT TGG GAG AAG ACT CTT CCC ACC TTA TCT GAG GAT CCA 1557 

SGPGITGNKKNPTSKPGKNS 539 

TCA GGG CCA GGC ATC ACT GGT AAT AAA AAG AAC CCA ACC TCT AAA CCG GGG AAG AAC AGT 1617 

ASEEDHLPLQVLQSP* 555 

GCC TCA GAG GAA GAC CAT CTG CCC CTT CAG GTC CTC CAG TCC CCC TGA 1665 

TGGCCCAGATGCAGCAGCAGGCTGGCAGGATGGAGTAGGGAATCTTCCCAGCCACACCAGAGGCTACTGAATTTTGGTG 
GAAATATAAATATTTTTTTTGCATAAAAAAAAAAAAAAAGGGCGGCCGC 
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GAP of: humanvr2.pep check: 5746 from: 1 to: 764 
humanVR2 Flh21ell 

to: humanvrl.pep check: 6877 from: 1 to: 839 
hmnanVRl _Fbhl8547pat - fchrb87a6, 3909 bases, 4554 checksum. 

Symbol comparison table: 

/ddm_local/gcg/gcg_9 • l/gcgcore/data/rundata/blosum62 . cmp 
CompCheck: 6430 

Gap Weight: 12 Average Match: 2.912 
Length Weight: 4 Average Mismatch: -2.003 

Quality: 1530 Length: 850 

Ratio: 2.003 Gaps: 10 

Percent Similarity: JS5.378 Percent Identity: 46.348 

Match display thresholds for the alignment (s ) : 
| = IDENTITY 
: = 2 
. = 1 

humanvr2.pep x humanvrl.pep 



1 MTSPSSSPVF 10 

|. I . .| 

1 MKKWSSTDLGTAADPLQKDTCPDPLDGDPNSRPPPAKPQLPTAKSRTRLF 50 

• . • * • 

11 RLETLDGGQEDGSEADRGKLDFGSGLPPMESQFQGEDRKFAPQIRVNLNY 60 

: |.|| : . | |. 

51 GKGDSEEAFPVDCPHEEGELDSCPTI . TVSPVITIQRPGD6PT6ARLLSQ 99 

• • « • . 

61 RKGTGASQPDPNRFDRDRLFNAVSRGVPEDLAGLPEYLSKTSKYLTDSEY 110 
:|| :| ||.. :|| | :| |. |:|||.|: 
100 DSVAASTEKTLRLYDRRSIFEAVAQNNCQDLESLLLFLQKSKKHLTDNEF 149 

• * • • * 

111 TEGSTGKTCLMKAVLNLKDGVNACILPLLQIDRDSGNPQPLVNAQCTDDY 160 

: lllllhll-lll II I I IN I • • • I I I I II I 

150 KDPETGKTCLLKAMLNLHDGQNTTIPLLLEIARQTDSLKELVNASYTDSY 199 

• • • • • 

161 YRGHS ALHI AIEKRSLQCVKLLVENGANVHARACGRFFQKGQG . TCFYFG 209 

hi •Mllllhl-: I MINIM I I I IM -I Ml 

200 YKGQTALHIAIERRNMALVTLLVENGADVQAAAHGDFFKKTKGRPGFYFG 249 

Fig. 5 
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• • • • • 

210 ELPLSLAACTKQWDWSYLLENPHQPASLQATDSQGNTVLHALVMISDNS 259 

IMIIIIIII I :| :|hl I I : I || lllllllll MM 
250 ELPLSLAACTNQLGIVKFLLQNSWQTADISARDSVGNTVLHALVEVADNT 299 

• • ♦ * • 

260 AENIALVTSMYDGLLQAGARLCPTVQLEDIRNLQDLTPLKLAAKEGKIEI 309 

Ml Mill. :| Ihl MMM I • :||| III III ■ 
300 ADNTKFVTSMYNEILMLGAKLHPTLKLEELTNKKGMTPLALAAGTGKIGV 349 

310 FRHILQREFS . . GLSHLSRKFTEWCYGPVRVSLYDLASVDSCEENSVLEI 357 

, BA : l 1 1 II lllllllll Nil Mill. HM-llllh 

350 LAYILQREIQEPECRHLSRKFTEWAYGPVHSSLYDLSCIDTCEKNSVLEV 399 

• • • • • 

358 IAF . HCKSPHRHRMVVLEPLNKLLQAKWDLLIPK . FFLNFLCNLIYMFIF 405 

IM ..|.|| |...||l|:||| III : : h Ml :|| II 
400 IAYSSSETPNRHDMLLVEPLNRLLQDKWDRFVKRIFYFNFLVYCLYMIIF 449 

• • • • • 

406 TAVAYHQPTLKKQAAPHLKAEVGNSMLLTGHILILLGGIYLLVGQLWYFW 455 

4Bn I IMI I M- -II II .MM « II 

450 TMAAYYRPV. . DGLPPFKMEKIGDYFRVTGEILSVLGGVYFFFRGIQYFL 497 

• • • • • 

456 RRHVF1WISFIDSYFEILFLFQALLTWSQVLCFLAIEWYLPLLVSALVL 505 

•I • Mill Ml M .-III |. :| -I I 

498 QRRPSKKTLFVDSYSEMLFFLQSLFMIATVVLYFSHLKEYVASMVFSLAL 547 

• • • • • 

506 GWLNLLYYTRGFQHTGIYSVMIQKVILRDLLRFLLIYLVFLFGFAVALVS 555 

CAO M MINIMI I I I - I M : M I I I 1 I ||: :|:||||l|. M- 
548 GVJTNMLYYTRGFQQMGIYAVMIEKMILRDLCRFMFVYIVFLFGFSTAWT 597 

556 LSQEAWRPEAPTGPNATESVQPMEGQEDEGNGAQYRGILEASLELFKFTI 605 

- ' :! I- • I I - I : MINIM 
598 LIEDGKNDSLPSESTSHRWRGPACRPPD S S YNSLYSTCLELFKFTI 643 

• • • • • 

606 GMGELAFQEQLHFRGMVLLLLLAYVLLTYILLLNMLIALMSETVNSVATD 655 

c MNI M N • ::MMM:MIMMMMMI MM N : 
644 GMGDLEFTENYDFKAVFIILLLAYVILTYILLLNMLIALMGETVNKIAQE 693 

• • • • • 

656 SWSIWKLQKAISVLEMENGYWWC . RKKQRA6VMLTVGTKPDGSPDERWCF 704 

_ I -11111:11.:!: I * I M M '\ II III I MM 
694 SKNIWKLQRAITILDTEKSFLKCMRKAFRSGKLLQVGYTPDGKDDYRWCF 743 

• * • • • 

705 RVEEVNWASWEQTLPTLCEDPSGA.GVPRTLENPVLASPPKEDEDGASEE 753 

nAA ll-tlll -I • : III II III • -I I • 

744 RVDEVNWTTWNTNVGIINEDPGNCEGVKRTLSFSLRSS RVSGRHWK 789 

• • • • • 

754 NYVPVQLLQSN 764 

h I IN 

790 NFALVPLLREASARDRQSAQPEEVYLRQFSGSLKPEDAEVFKSPAASGEK 839 
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GAP of: humanvr2 . seq check: 8853 from: 1 to: 2809 
humanVR2 21611a, 2809 bases, 8853 checksum. 

to: humanvrl.seq check: 4554 from: 1 to: 3909 

humanVRl Fbhl8547pat - Import - complete 

Symbol comparison table: 
/ddm_local/gcg/gcg_9.1/gcgcore/data/rundata/nwsgapdna.cmp 
CompCheck: 8760 

Gap Weight: 50 Average Match: 10,000 

Length Weight: 3 Average Mismatch: 0.000 

Quality: 14359 Length: 3934 

Ratio: 5.112 Gaps: 15 

Percent Similarity: 55.316 Percent Identity: 55.316 

Match display thresholds for the alignment (s) : 
| = IDENTITY 
: = 5 
. = 1 

humanvr2.seq x humanvrl.seq 



1 GGCTAGCCTGTCCTGACAGGGGAGAG 26 

I I I I I II III 

801 TGTCCACAGTAGTCCCCCCTTATCCACGGGTGTCACTTTCCATGGGTTCA 850 
. . . • • 

27 TTAAGCTCCCGTTCTCCACCGTGCCGGCTGGCCAGGTGGGCTGAGGGTGA 76 

II I I II I I I I II I III II 

851 GTTATTTGCGGTCAACCACGGTCTGCCAATATTAAATGGAAAATTCTTCA 900 

. . . • • 

77 CCGAGAGACCAGAACCTGCTTGCTGGAGCTTAGTGCTCAGAGCTGGGGAG 126 

II III I I I II I I II III I I II 

901 AACAGTTCCCAAGTTTTCCCTTGTGCATTGTTCTGAGCAGTGTGATGAAG 950 

. » * • • 

127 GGAGGTTCCGCCGCTCCTCTGCTGTCAGCGCCGGCAGCCCCTCCCGGCTT 176 

I I I II I II I I III I I 

951 AGTCTCTGCCGTGCCATCTGGGATGCAAACCGTCCCTGTGTCCCCCACGT 1000 
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• • • • • 

177 CACTTCCTCCCGCAGCCCCTGCTACTGAGAAGCTCCGGGATCCCAGCAGC 226 

lftft1 I II Ml II Ml III II I I I 

1001 CCAGGCCGTAGATGCTCCCCGCCGGTCAGTCACTTAGTCGTCAGATCGCC 1050 

• • • • 

227 CGCCACGCCCTGGC CTCAGCCTGCGGG 253 

II I I II II III I I 

1051 CGTCCTGGTATCACAGTGCTTCTGTTCAGGTTGCACACTGGGCCACAGAG 1100 

• • . . . 
254 GCTCCAGTCAGGCCAACACCGACGCGCAGCTGGGAGGAAG 293 

„„, i urn in i i inn ii i 

1101 GATCCAGCAAGGATGAAGAAAT6GAGCAGCACAGACTTGGGGACAGCTGC 1150 

• • • . • 

294 ACAGGACCCTTGACATCTCCATCTGCACAGAGGTCCTG 331 

i urn i H i N in i in in 

1151 GGACCCACTCCAAAAGGACACCTGCCCAGACCCCCTGGATGGAGACCCTA 1200 

• • . . . 
332 GCTGGACCGAGCAGCCTCCTCCTCCTAGGATGACCTCACCCTCCAGC . . T 379 

„ ftt HI I H I M I I II I II III 

1201 ACTCCAGGCCACCTCCAGCCAAGCCCCAGCTCCCCACGGCCAAGAGCCGC 1250 

380 CTCCAGTTTTCAGGTTGGAGACATTAGATGGAGGCCAAGAAGATGGCTCT 429 

JJU 1111 ii iii nun i m i 

1251 ACCCGGCTCTTTGGGAAGGGTGACTCGGAGGAGGCTTTCCCGGTGGATTG 1300 

• • . 

430 GAGGCGGACAGAGGAAAGCTGGATTTTGGGAGCGGGCTGCCTCCCATGGA 479 

I III I I I I II I I I 

1301 CCCCCACGAGGAAGGTGAGTTGGACTCCTGCCCGACCATCACAGTCAGCC 1350 

480 GTCACAGTTCC AGGGCGAGGACCGGAAATTCGCCCCTCAGATAAGAGTC A 529 

„ B1 I I HI MM I III I III I I I 

1351 CTGTTATCACCATCCAGAGGCCAGGAGACGGCCCCACCGGTGCCAGG. . C 1398 

• • . . 

530 ACCTCAACTACCGAAAGGGAACAGGTGCCAGTCAGCCGGATCCAAACCGA 579 
II I I IN II II I I I | 

1399 TGCTGTCCCAGGACTCTGTCGCCGCCAGCACCGAGAAGACCCTCAGGCTC 1448 

• • • * » 

580 TTTGACCGAGATCGGCTCTTCAATGCGGTCTCCCGGGGTGTCCCCGAGGA 629 

„ Aa I Ml II I INI I II II I I I I I I 1 1 1 1 

1449 TATGATCGCAGGAGTATCTTTGAAGCCGTTGCTCAGAATAACTGCCAGGA 1498 

• . . 

630 TCTGGCTGGACTTCCAGAGTACCTGAGCAAGACCAGCAAGTACCTCACCG 679 

1400 Mill I M I Mill MINI III II Mill I 

1499 TCTGGAGAGCCTGCTGCTCTTCCTGCAGAAGAGCAAGAAGCACCTCACAG 1548 
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680 


ACTCGGAATACACAGAGGGCTCCACAGGTAAGACGTGCCTGATGAAGGCT 

II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


729 


1549 


II II 1 II III 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 
ACAACGAGTTCAAAGACCCTGAGACAGGGAAGACCTGTCTGCTGAAAGCC 


1598 


730 


• * • • • 

GTGCTGAACCTTAAGGACGGAGTCAATGCCTGCATTCTGCCACTGCTGCA 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I II II 1 1 1 1 1 MINI 


779 


1599 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 II III 1 
ATGCTCAACCTGCACGACGGACAGAACACCACCATCCCCCTGCTCCTGGA 


1648 


780 


• • • • • 

GATCGACAGGGACTCTGGCAATCCTCAGCCCCTGGTAAATGCCCAGTGCA 

1 1 1 1 1 III till 1 II 1 1 1 1 1 1 1 1 1 III 


829 


1649 


1 1 1 1 1 III 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II 1 
GATCGCGCGGCAAACGGACAGCCTGAAGGAGCTTGTCAACGCCAGCTACA 


1698 


830 


• • • • • 

CAGATGACTATTACCGAGGCCACAGCGCTCTGCACATCGCCATTGAGAAG 

1 II III III 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


879 


1699 


1 II III III 1 1 1 1 1 1 II 1 II II 1 II 1 1 1 1 II 1 1 1 1 
CGGACAGCTACTACAAGGGCCAGACAGCACTGCACATCGCCATCGAGAGA 


1748 


880 


• • • • • 

AGGAGTCTGCAGTGTGTGAAGCTCCTGGTGGAGAATGGGGCCAATGTGCA 
II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


929 


1749 


II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
CGCAACATGGCCCTGGTGACCCTCCTGGTGGAGAACGGAGCAGACGTCCA 


1798 


930 


• • • • • 

TGCCCGGGCCTGCGGCCGCTTCTTCCAGAAGGGCCAAG. . . GGACTTGCT 

II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


976 


1799 


II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
GGCTGCGGCCCATGGGGACTTCTTTAAGAAAACCAAAGGGCGGCCTGGAT 


1848 


977 


• • • • • 

TTTATTTCGGTGAGCTACCCCTCTCTTTGGCCGCTTGCACCAAGCAGTGG 

1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 III 1 


1026 


1849 


1 II 1 1 1 1 II 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 III 1 
TCTACTTCGGTGAACTGCCCCTGTCCCTGGCCGCGTGCACCAACCAGCTG 


1898 


1027 


• • • • • 

GATGTGGTAAGCTACCTCCTGGAGAACCCACACCAGCCCGCCAGCCTGCA 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 


1076 


1899 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 
GGCATCGTGAAGTTCCTGCTGCAGAACTCCTGGCAGACGGCCGACATCAG 


1948 


1077 


• • • • • 

GGCCACTGACTCCCAGGGCAACACAGTCCTGCATGCCCTAGTGATGATCT 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 III 1 1 


1126 


1949 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 III 1 1 
CGCCAGGGACTCGGTGGGCAACACGGTGCTGCACGCCCTGGTGGAGGTGG 


1998 


1127 


• • * • • 

CGGACAACTCAGCTGAGAACATTGCACTGGTGACCAGCATGTATGATGGG 


1176 


•I ft ft ft 

1999 


1 llllll 1 II II INI 1 Mill Mill!!! Ill 1 

CCGACAACACGGCCGACAACACGAAGTTTGTGACGAGCATGTACAATGAG 


2048 


1177 


• ♦ • • • 
CTCCTCCAAGCTGGGGCCCGCCTCTGCCCTACCGTGCAGCTTGAGGACAT 


1226 


2049 


i ii iiiiii ii iii ii ii mi mil i 

ATTCTGATGCTGGGGGCCAAACTGCACCCGACGCTGAAGCTGGAGGAGCT 


2098 
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• ♦ • • • 

1227 CCGCAACCTGCAGGATCTCACGCCTCTGAAGCTGGCCGCCAAGGAG6GCA 1276 

_ I MM I Ml I MMI Ml Mill II I II I 

2099 CACCAACAAGAAGGGAATGACGCCGCTGGCTCTGGCAGCTGGGACCGGGA 2148 

• • • • • 

1277 AGATCGAGATTTTCAGGCACATCCTGCAGCGGGAGTT TTCAGGA 1320 

_ MUM I I II I II II I M ! 1 1 1 1 1 I I I 

2149 AGATCGGGGTCTTGGCCTATATTCTCCAGCGGGAGATCCAGGAGCCCGAG 2198 

• • • « • 
1321 CTGAGCCACCTTTCCCGAAAGTTCACCGAGTGGTGCTATGGGCCTGTCCG 1370 

oiqq II Mill III I MMIIIIMIMM III Mill II I 

2199 TGCAGGCACCTGTCCAGGAAGTTCACCGAGTGGGCCTACGGGCCCGTGCA 2248 

• • • • • 
1371 GGTGTCGCTGTATGACCTGGCTTCTGTGGAC AGCTGTGAGGAGAACTC AG 1420 

III II I II llllll I I I fill Ml III 1 1 1 1 1 1 1 I 

2249 CTCCTCGCTGTACGACCTGTCCTGCATCGACACCTGCGAGAAGAACTCGG 2298 

• • • • • 
1421 TGCTGGAGATCATTGCCTTTCATTGCA. . . AGAGCCCGCACCGACACCGA 1467 

„ MINIM I II MM III III III I II III 

2299 TGCTGGAGGTGATCGCCTACAGCAGCAGCGAGACCCCTAATCGCCACGAC 2348 

• • • * • 

1468 ATGGTCGTTTTGGAGCCCCTGAACAAACTGCTGCAGGCGAAATGGGA. . . 1514 

„ Ml II I lllllll llllll III llllll; 1 II Mill 

2349 ATGCTCTTGGTGGAGCCGCTGAACCGACTCCTGCAGGACAAGTGGGACAG 2398 

• • • • ♦ 

1515 TCTGCTCATCCCCAAGTTCTTCTTAAACTTCCTGTGTAATCTGATCTACA 1564 

„ OQ I Ml I II MM III lllllllll I I MM 

2399 ATTCGTCAAGCGCATCTTCTACTTCAACTTCCTGGTCTACTGCCTGTACA 2448 

• • • • • 
1565 TGTTCATCTTCACCGCTGTTGCCTACCATCAGCCTACCCTGAAGAAGCAG 1614 

II MIMIIMM I lllllll I I III II 

2449 TGATCATCTTCACCATGGCTGCCTACTA CAGGCCCGTGGATGGCTT 2494 

• • ♦ • • 

1615 GCCGCCCCTCACCTGAAAGCGGAGGTTGGAAACTCCATGCTGCTGACGGG 1664 

0 „ QC Ml Ml I I III I Mill III I I I II II 

2495 GCCTCCCTTTA. .AGATGGAAAAAATTGGAGACTATTTCCGAGTTACTGG 2542 

• • • • • 

1665 CCACATCCTTATCCTGCTAGGGGGGATCTACCTCCTCGTGGGCCAGCTGT 1714 

I MMI II MM II Mill II I I I II 

2543 AGAGATCCTGTCTGTGTTAGGAGGAGTCTACTTCTTTTTCCGAGGGATTC 2592 

• • « • • 
1715 GGTACTTCTGGCGGCGCCACGTGTTCATCTGGATCTCGTTCATAGACAGC 1764 

III Ml II M I lllllll III I llllll 

2593 AGTATTTCCTGCAGAGGCGGCCGTCGATGAAGACCCTGTTTGTGGACAGC 2642 
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• • • • • 



1765 


TACTTT6AAATCCTCTTCCTGTTCCAGGCCCTGCTCACAGTGGTGTCCCA 


1814 


2643 


111 III II II III 1 1 III 1 III III III II 
TACAGTGAGATGCTTTTCTTTCTGCAGTCACTGTTCATGCTGGCCACCGT 


2692 


1815 


• • • • • 

GGTGCTGTGTTTCCTGGCCATCGAGTGGTACCTGCCCCTGCTTGTGTCTG 


1864 


2693 


Mllllll III 1 II II III III 1 II 1 
GGTGCTGTACTTCAGCCACCTCAAGGAGTATGTGGCTTCCATGGTATTCT 


2742 


1865 


• • • • • 

CGCTGGTGCTGGGCTGGCTGAACCTGCTTTACTATACACGTGGCTTCCAG 


1914 


2743 


1 Mil Mllllll III MM Mill II II II MUM 

CCCTGGCCTTGGGCTGGACCAACATGCTCTACTACACCCGCGGTTTCCAG 


2792 


1915 


• • • • • 

CACACAGGCATCTACAGTGTCATGATCCAGAAGGTCATCCTGCGGGACCT 


1964 


2793 


II 1 MINIM Mllllll Mill 1 MUM 1 Mill 

CAGATGGGCATCTATGCCGTCATGATAGAGAAGATGATCCTGAGAGACCT 


2842 


1965 


• • • • • 

GCTGC6CTTCCTTCTGATCTACTTAGTCTTCCTTTTCGGCTTCGCTGTAG 


2014 


2843 


1 II III 1 1 Mill 1 MUM 1 Mill II 1 II 

GTGCCGTTTCATGTTTGTCTACATCGTCTTCTTGTTCGGGTTTTCCACAG 


2892 


2015 


• • • • • 
CCCTGGTGAGCCTGAGCCAGGAGGCTTGGCGCCCCGAAGCTCCTACAGGC 


2064 


2893 


1 MUM MM 1 II 1 1 1 II 1 1 

CGGTGGTGACGCTGATTGAAGACGGGAAGAATGACTCCCTGCCGTCTGAG 


2942 


2065 


• • • • • 

CCCAATGCCACAGAGTCAGTGCAGCCCATGGAGGGACAGGAGGACGAGGG 


2114 


2943 


III 1 III 1 III 1 1 II 


2980 


2115 


* • • • • 

CAACGGGGCCCAGTACAGGGGTATCCTGGAAGCCTCCTTGGAGCTCTTCA 


2164 


2981 


1 II 1 MM II III 1 lllllll MM 

CCCCGATAGCTCCTACAACAGCCTGTACTCCACCTGCCTGGAGCTGTTCA 


3030 


2165 


• • • • » 

AATTCACCATCGGCATGGGCGAGCTGGCCTTCCAGGAGCAGCTGCACTTC 


2214 




1 IIMIIIIIIIIIIIIIIII MM III III 1 Mill 

AGTTCACCATCGGCATGGGCGACCTGGAGTTCACTGAGAACTATGACTTC 


3080 


2215 


• • » • • 
CGCGGCATGGTGCTGCTGCTGCTGCTGGCCTACGTGCTGCTCACCTACAT 


2264 


3081 


1 1 1 1 1 llllllllllllll II 1 IMIMIMM 

AAGGCTGTCTTCATCATCCTGCTGCTGGCCTATGTAATTCTCACCTACAT 


3130 


2265 


• • • • • 

CCTGCTGCTCAACATGCTCATCGCCCTCATGAGCGAGACCGTCAACAGTG 


2314 


3131 


Ml lllllllllllllllllllllllllll 1 Mill lllllll 

CCTCCTGCTCAACATGCTCATCGCCCTCATGGGTGAGACTGTCAACAAGA 


3180 
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• • • • • 

2315 TCGCCACTGACAGCTGGAGCATCT6GAAGCTGCAGARAGCCATCTCTGTC 2364 

„„, 1 1 1 1 II III II lllllllllllllllll IMIIII I II 

3181 TCGCACAGGAGAGCAAGAACATCTGGAAGCTGCAGAGAGCCATCACCATC 3230 

• • • • • 

2365 CTGGAGATGGAGAATGGCTATTGGTGGTGCAGGAAGAAG. . .CAGCGGGC 2411 

him i mill mi mil n mi ii i 

3231 CTGGACACGGAGAAGAGCTTCCTTAAGTGCATGAGGAAGGCCTTCCGCTC 3280 

• • • • • 

2412 AGGTGTGATGCTGACCGTTGGCACTAAGCCAGATGGCAGCCCGGATGAGC 2461 

in i mil ii ii i ii iiiim ii i i 

3281 AGGCAAGCTGCTGCAGGTGGGGTACACACCTGATGGCAAGGACGACTACC 3330 

• • • • • 

2462 GCTGGTGCTTCAGGGTGGAGGAGGTGAACTGGGCTTCATGGGAGCAGACG 2511 

I lllllllllllllllll MINIMUM I I Ml I I 

3331 GGTGGTGCTTCAGGGTGGACGAGGTGAACTGGACCACCTGGAACACCAAC 3380 

• • • • • 
2512 CTGCCTACGCTGTGTGAGGACCCG. . . TCAGGGGCAGGTGTCCCTCGAAC 2558 

„„ ii i i ii mill i i ii in ii ii 

3381 GTGGGCATCATCAACGAAGACCCGGGCAACTGTGAGGGCGTCAAGCGCAC 3430 

• • • • • 
2559 TCTCGAGAACCCTGTCCTG GCTTCCCCTCCCAAGGAGGATGAGGAT 2604 

II I I I I I II II ll ll 

3431 CCTGAGCTTCTCCCTGCGGTCAAGCAGAGTTTCAGGCAGACACTGGAAGA 3480 

• • • • • 
2605 GGTGCCTCTGAGGAAAACTATGTGCCCGTCCAGCTCCTCCAGTCCAACTG 2654 

, JO , 1 11 111 II I III II II 

3481 ACTTTGCCCTGGTCCCCCTTTTAAGAGAGGCAAGTGCTCGAGATAGGCAG 3530 

• ♦ • • • 
2655 ATGGCCCAGATGCAGCAGGAGGCCAGAGGACAGAGCAGAGGATCTTTCCA 2704 

M IN M I I Mill III III I I 

3531 TCTGCTCAGCCCGAGGAAGTTTATCTGCGACAGTTTTCAGGGTCTCTGAA 3580 

• • • • « 
2705 ACCACATCTGCTGGCTCTGGGGTCCCAGTGAATTCTGGTGGCAAATATAT 2754 

,„ Ml III III MM II I III I 

3581 GCCA GAGGACGCTGAGGTCTTCAAGAGTCCTGCCGCTTCCGGGGA 3625 

• • • • » 
2755 ATTTTCACTAACTQAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 2804 

I I I III I I I I III III 

3626 GAAGTGAGGACGTCACGCAGACAGCACTGTCAACACTGGGCCTTAGGAGA 3675 

• • • * • 
2805 AAAAA 2809 

3676 CCCCGTTGCCACGGGGGGCTGCTGAGGGAACACCAGTGCTCTGTCAGCAG 3725 
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CLUSTAL W (1.74) multiple sequence alignment 

humanVR2 mtspssspvfrletldggqedgseadrgkldfgsglppmesqfqgedrkfapqirvnlny 
ratVR2 - 

humanVR2 rkgtgasqpdpnrfdrdrlfnavsrgvpedlaglpeylsktskyltdseytegstgktcl 
ratVR2 - - 

humanVR2 mkavlnlkdgvnacilpllqidrdsgnpqplvnaqctddyyrghsalhiaiekrslqcvk 
ratVR2 -- - - 

humanVR2 llvenganvharacgrffqkgqgtcfyfgelplslaactkqwdwsyllenphqpaslqa 

rat VR2 sthasalslaactkqwdwtyllenphqpaslea 

*************.************.* 

* • • 

h u m an VR2 tdsqgntvlhalvmisdnsaenialvtsmydgllqagarlcptvqledirnlqdltplkl 
rat VR2 tdslgntvlhalvmiadnspensalvihmydgllqmgarlcptvqleeisnhqgltplkl 

*** ***********.*** ** *** ******* ************* * * ****** 

• • • • 

h u man VR2 aakegkieifrhilqrefsg- lshlsrkftewcygpvrvslydlasvdsceensvlei ia 
rat VR2 aakegkieifrhilqrefsgpyqplsrkftewcygpvrvslydlssvdsweknsvleiia 

******************** ************************* *.******** 

• • « 

humanVR2 fhcksphrhriwvleplnkllqakwdllipkfflnflcnliymfiftavayhqptlkkqa 
rat VR2 fhckspnrhriwvlepujkllqekitorlvsrfffotacylvymfiftvvayhqpsldqpa 

******.*************** *** *. .**.** * *.****** ******.* . * 



humanVR2 aphijcaevgnsmlltghilillggiyllvgqlwyfwrrhvfiwisfidsyfeilflfqal 
rat VR2 ipsskatfgesmlllghilillggiylllgqlwyfwrrrlfiwisfmdsyfeilfllqal 

* ** *.**** *************.*********..******.*********.*** 



humanVR2 ltwsqvlcflaiewylpllvsalvlgwlnllyytrgfqhigiysvmiqkvilrdllrfl 
rat VR2 ltvlsqvlrfmetewylpllvlslvlgwlhllyytrgfqhtgiysvmiqkvilrdllrfl 

******** *. ******** ************************************** 

• • • 

humanVR2 liylvflfgfavalvslsqeawrpeaptgpnatesvqpmegqedegngaqyrgileasle 
rat VR2 lvylvflfgfavalvslsrearspkapednnstvteqptvgqeeep— apyrsildasle 

*.****************.** *.** *.* . ** ***.* * ** **.**** 

humanVR2 lfkftigmgelafqeqlhfrgmvlllllayvlltyilllnmlialmsetvnsvatdswsi 
rat VR2 lfkftigmgelafqeqlrfrgwlllllayvlltyvlllnmlialmsetvnhvadnswsi 

*****************.***.*************.*************** ** .**** 

• • • • 

humanVR2 wklqi^isvlemengywwcr-kkqragvmltvgtkpdgspderwcfrveevnwasweqtl 
rat VR2 wklqkaisvlemengywwcrrkkhregrllkvgtrgdgtpderwcfrveevnkaawektl 

******************** **.* * .* ***. **.***************.**.** 

• • • • • • • 

humanVR2 ptlcedpsgagvprtlenpvlasppkededgaseenyvpvqllqsn 

rat VR2 PTLSEDPSGPGITGNKKNPTSK-PGK NSASEEDHLPLQVLQSP 

*** ***** *. .** * * . ★***...*.*.*** 

• •••••• • • ••••• 
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GAP of: ratvr2.pep check: 9190 from: 1 to: 554 
ratVR2 Flrxbl47gll 

to: humanvr2.pep check: 5746 from: 1 to: 764 
humanVR2 Flh21ell 

Symbol comparison table: /usr/local/gog_9.1/gcgcore/data/rundata/blosum62.cmp 
CompCheck: 6430 



Gap Weight: 


12 


Average Match: 


2.912 


Length Weight: 


4 


Average Mismatch: 


-2.003 


Quality: 


2182 


Length: 


766 


Ratio: 


3.939 


Gaps: 


4 


Percent Similarity: 


81.703 


Percent Identity: 


79.167 



Match display thresholds for the alignment (s) : 
| = IDENTITY 
: = 2 
. = 1 



ratvr2.pep x humanvr2.pep 



1 




44 


201 


IMIIMI Ml MM 1 IMIIIMMMIM MINI 

GQGTCFYFGELPLSLAACTKQWDWSYLLENPHQPASLQATDSQGNTVLH 


250 


45 


• • • * • 

ALVMIADNSPENSALVIHMYDGLLQMGARLCPTVQLEEISNHQGLTPLKL 

Mill-Ill II III lllllll IIMIIIMIM 1 I MIMI 


94 


251 


ALVMISDNSAENIALVTSMYDGLLQAGARLCPTVQLEDIRNLQDLTPLKL 


300 


95 


• * * • • 
AAKEGKIEIFRHILQREFSGPYQPLSRKFTEWCYGPVRVSLYDLSSVDSW 

MINIMI lllllllllll M IIMIIIIMMMMIMIIII 


144 


301 


AAKEGKIEIFRHILQREFSG . LSHLSRKFTEWCYGPVRVSLYDLASVDSC 


349 


145 


• • • • • 

EKNSVLEIIAFHCKSPNRHRMWLEPLNKLLQEKWDRLVSRPFFNFACYL 

MIIIIMM Mill MIMI MM Ml II II III M Ml II 1 I 

EENSVLEIIAFHCKSPHRHRMVVLEPLNKLLQAKWDLLIPKFFLNFLCNL 


194 


350 


399 


195 


• • • • • 

VYMFIFTWAYHQPSLDQPAIPSSKATFGESMLLLGHILILLGGIYLLLG 


244 


400 


MIMM llllll-l .Mill MM IMMIMMIMM 

IYMFIFTAVAYHQPTLKKQAAPHLKAEVGNSMLLTGHILILLGGIYLLVG 


449 
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• • • • • 

245 QLWYFWRRRLF IWI S FMDS YFEILFLLQALLTVL SQVLRFMETEWYLPLL 294 

MINIM MIMMMMIMM MMMMMI h MMMI 

450 QLWYFWRRHVFIWISFIDSYFEILFLFQALLTWSQVLCFLAIEWYLPLL 499 

• • • • • 

295 VLSLVLGWLNLLYYTRGFQHTGIYSVMIQKVILRDLLRFLLVYLVFLFGF 344 

I •llllllllillllllllllllllllllllllllllllhllllllll 

500 VSALVLGWLNLLYYTRGFQHTGI YSVMI QKVILRDLLRFLLI YLVFLFGF 549 

• • • • • 

345 AVALVSLSREARSPKAPEDNNSTVTEQPTVGQEEE. . PAPYRSILDASLE 392 

MMMMMI IMI IM - II IMM I II MMMI 

550 AVALVSLSQEAWRPEAPTGPNATESVQPMEGQEDEGNGAQYRGILEASLE 599 

• * • * • 

393 LFKFTIGMGELAFQEQLRFRGVVLLLLLAYVLLTYVLLLl^LIALMSETV 442 

1 1 1 ! 1 1 1 M 1 1 1 M I M MIMMMMMMMMMMMMIMM 

600 LFKFTIGMGELAFQEQLHFRGMVLLLLLAYVLLTYILLLNMLIALMSETV 649 

• • • • • 

443 NHVADNSWS IWKLQKAI S VLEMENGYWWCRRKKHREGRLLKVGTRGDGTP 492 

I II MIIIIIIMMIII MIMIMI III I I M llh MM 

650 NSVATDSWS IWKLQKAI SVLEMENGYWWC . RKKQRAGVMLTVGTKPDGSP 698 

• ♦ • • • 

493 DERWCFRVEEVNWAAWEKTLPTLSEDPSGPGITGNKKNPT . • • . SKPGKN 538 

Mill IMIIIIINMMII M Mill M Ml I •• 

699 DERWCFRVEEVNWASWEQTLPTLCEDPSGAGVPRTLENPVLASPPKEDED 748 

539 SASEEDHLPLQVLQSP 554 

lll|.:.|.|.|ll 
749 GASEENYVPVQLLQSN 764 
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GAP of: humanvrl.seq check: 4554 from: 1 to: 3909 
humanVRl Fbhl8547pat - Import - complete 

to: ratvrl .seq check: 7921 from: 1 to: 2847 
ratVRl.seq AF029310 in GenBank 

Symbol comparison table: 

/ddm_local/gcg/gcg_9 . 1/gcgcore/data/rundata/nwsgapdna. cmp 
CompCheck: 8760 

Gap Weight: 50 Average Match: 10.000 

Length Weight: 3 Average Mismatch: 0.000 

Quality: 22717 Length: 3914 

Ratio: 7.979 Gaps: 10 

Percent Similarity: 82.125 Percent Identity: 82.125 

Match display thresholds for the alignment (s) : 
| = IDENTITY 
: = 5 
. = 1 

humanvrl . seq x ratvrl . seq 

• * * • . 
1001 CCAGGCC6TAGATGCTCCCCGCCGGTCAGTCACTTAGTCGTCAGATCGCC 1050 

III III 
1 CAGCTCCAAGGCACTTGCTCC 21 

• • . . • 

1051 CGTCCTGGTATCACAGTGCTTCTGTTCAGGTTGCACACTGGGCCACAGAG 1100 

I IN I I I I I I lilllll I llllllllllll 

22 ATTTGGGGTGTGCCTGCACCT . . . AGCTGGTTGCAAATTGGGCC ACAGAG 68 

. • • . • 

1101 GATCCAGCAAGGATGAAGAAATGGAGCAGCACAGACTTGGGGACAGCTGC 1150 

MM I lilllll I II II III lllll I I III 

69 GATCTGGAAAGGATGGAACAACGGGCTAGCTTAGACTCAGAGGAGTCTGA 118 

• • * • * 

1151 GGACCCACTCCAAAAGGACACCTGCCCAGACCCCCTGGATGGAGACCCTA 1200 

i iiiii mi it ii mill mil i ii iiiiiiiii 

119 GTCCCCACCCCAAGAGAACTCCTGCCTGGACCCTCCAGACAGAGACCCTA 168 
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1201 


ACTCCAGGCCACCTCCAGCCAAGCCCCAGCTCCCCACGGCCAAGAGCCGC 


1250 


169 


III II lllllllllll MINIM II III III III II 

ACTGCAAGCCACCTCCAGTCAAGCCCCACATCTTCACTACCAGGAGTCGT 


218 


1251 


• • • » • 

ACCCGGCTCTTTGGGAAGGGTGACTCGGAGGAGGCTTTCCCGGTGGATTG 


1300 


219 


llllllll llllllllllllllllllllllllll 1 II Mil II 
ACCCGGCTTTTTGGGAAGGGTGACTCGGAGGAGGCCTCTCCCCTGGACTG 


268 


1301 


* • • • • 

CCCCCACGAGGAAGGTGAGTTGGACTCCTGCCCGACCATCACAGTCAGCC 


1350 


269 


iii i iiiiiiii i i iii iiiiiiii i iiiiii Mini 

CCCTTATGAGGAAGGCGGGCTGGCTTCCTGCCCTATCATCACTGTCAGCT 


318 


1351 


• • • » • 

CTGTTATCACCATCCAGAGGCCAGGAGACGGCCCCACCGGTGCCAGGCTG 


1400 


319 


inn i ii iiiMiiiiii ii ii ii ii ii iii inn i 

CTGTTCTAACTATCCAGAGGCCTGGGGATGGACCTGCCAGTGTCAGGCCG 


368 


1401 


• • • • • 

CTGTCCCAGGACTCTGTCGCCGCCAGCACCGAGAAGACCCTCAGGCTCTA 


1450 


369 


lllllllllll III MM 1 IIIIII III IIIIIIII 

TCATCCCAGGACTCCGTCTCCGCTGG. . . TGAGAAGCCCCCGAGGCTCTA 


415 


1451 


• • • • • 

TGATCGCAGGAGTATCTTTGAAGCCGTTGCTCAGAATAACTGCCAGGATC 


1500 


416 


llllllllllll Mill II II II IIIIII MMMIIMM 1 

TGATCGCAGGAGCATCTTCGATGCTGTGGCTCAGAGTAACTGCCAGGAGC 


465 


1501 


• • • • • 

TGGAGAGCCTGCTGCTCTTCCTGCAGAAGAGCAAGAAGCACCTCACAGAC 


1550 


466 


MMMIMIIMM lllllllllll lllllllllll Ml II III 

TGGAGAGCCTGCTGCCCTTCCTGCAGAGGAGCAAGAAGCGCCTGACTGAC 


515 


1551 


• • • * • 

AACGAGTTCAAAGACCCTGAGACAGGGAAGACCTGTCTGCTGAAAGCCAT 


1600 


516 


1 ! M M 1 1 1 1 1 1 1 1 M IIIIIIII llllllllllllll IIIIIIII 
AGCGAGTTCAAAGACCCAGAGACAGGAAAGACCTGTCTGCTAAAAGCCAT 


565 


1601 


• • • • • 

GCTCAACCTGCACGACGGACAGAACACCACCATCCCCCTGCTCCTGGAGA 


1650 


D00 


IIIIII IIIIII 1 II Mill IMIIII 1 lllllllllll 

bL I LAATCTGCACAATGGGCAGAATGACACCATC 


013 


1651 


• • t • • 
TCGCGCGGCAAACGGACAGCCTGAAGGAGCTTGTCAACGCCAGCTACACG 


1700 


616 


1 II III 1 II llllllllllll II IMIIII MINIMI! 

TTGCCCGGAAGACAGACAGCCTGAAGCAGTTTGTCAATGCCAGCTACACA 


665 


1701 


• • • • • 

GACAGCTACTACAAGGGCCAGACAGCACTGCACATCGCCATCGAGAGACG 


1750 


666 


1 1 1 1 1 ! 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! Mill II 1 II 

GACAGCTACTACAAGGGCCAGACAGCACTGCACATTGCCATTGAACGGCG 


715 



Fig. 9 continued 



SUBSTITUTE SHEET (RULE 26) 

BNSDOCID: <WO__OG29577A 1 JA> 



WO 00/29577 



PCT/US99/26701 



25/37 

• • • • 



1751 


CAACATGGCCCTGGTGACCCTCCTGGTGGAGAACGGAGCAGACGTCCAGG 


1800 


716 


linn i iiiniiiiiii iiiiiiini iiniiii mini 

GAACATGACGCTGGTGACCCTCTTGGTGGAGAATGGAGCAGATGTCCAGG 


765 


1801 


• • • • • 

CTGCGGCCCATGGGGACTTCTTTAAGAAAACCAAAGGGCGGCCTGGATTC 


1850 


766 


lllllll i IIIIIIINI lllllllllllllll lllllll III 

CTGCGGCTAACGGGGACTTCTTCAAGAAAACCAAAGGGAGGCCTGGCTTC 


815 


1851 


• • • • • 

TACTTCGGTGAACTGCCCCTGTCCCTGGCCGCGTGCACCAACCAGCTGGG 


1900 


816 


INN Mill lllllllllllllllll lllllllllllllllllll 

TACTTTGGTGAGCTGCCCCTGTCCCTGGCTGCGTGCACCAACCAGCTGGC 


865 


1901 


• • • • • 

CATCGTGAAGTTCCTGCTGCAGAACTCCTGGCAGACGGCCGACATCAGCG 


1950 


866 


III llllllllllllllllllllllllllllll 1 II MINIM! 

CATTGTGAAGTTCCTGCTGCAGAACTCCTGGCAGCCTGCAGACATCAGCG 


915 


1951 


• • • • • 

CCAGGGACTCGGTGGGCAACACGGTGCTGCACGCCCTGGTGGAGGTGGCC 

M lllllll lllllllllllllllll II lllllllllllllllll 
CCCGGGACTCAGTGGGCAACACGGTGCTTCATGCCCTGGTGGAGGTGGCA 


2000 


916 


965 


2001 


• • • • • 

GACAACACGGCCGACAACACGAAGTTTGTGACGAGCATGTACAATGAGAT 


2050 


966 


II lllll 1 IINIIII Mill Mill lllllllllll Mill 

GATAACACAGTTGACAACACCAAGTTCGTGACAAGCATGTACAACGAGAT 


1015 


2051 


TCTGATGCTGGGGGCCAAACTGCACCCGACGCTGAAGCTGGAGGAGCTCA 


2100 


1016 


MM MMMIMIMM lllll llllllllllllll III III 

CTTGATCCTGGGGGCCAAACTCCACCCCACGCTGAAGCTGGAAGAGATCA 


1065 


2101 


• • • • • 

CCAACAAGAAGGGAATGACGCCGCTGGCTCTGGCAGCTGGGACCGGGAAG 


2150 


1066 


mm mill i mil imimiimi m i i mm 

CCAACAGGAAGGGGCTCACGCCACTGGCTCTGGCTGCTAGCAGTGGGAAG 


1115 


2151 


• • • • • 

ATCGGGGTCTTGGCCTATATTCTCCAGCGGGAGATCCAGGAGCCCGAGTG 


2200 




IIIIMIIIIIIIIIII lllllllll llllllllll II llllllll 


HDD 


2201 


• • • • • 

CAGGCACCTGTCCAGGAAGTTCACCGAGTGGGCCTACGGGCCCGTGCACT 


2250 


1166 


1 1 lllll IIIIIIIIIIIIIIIM llllllll lllll lllllll 
CCGACACCTATCCAGGAAGTTCACCGAATGGGCCTATGGGCCAGTGCACT 


1215 


2251 


• • • • » 

CCTCGCTGTACGACCTGTCCTGCATCGACACCTGCGAGAAGAACTCGGTG 


2300 


1216 


MM II II llllllllllllll IMIIMI II lllllllllll 

CCTCCCTTTATGACCTGTCCTGCATTGACACCTGTGAAAAGAACTCGGTT 


1265 
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2301 


» • • « • 

CTGGAGGTGATCGCCTACAGCAGCAGCGAGACCCCTAATCGCCACGACAT 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 


2350 


1266 


1 II 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 
CTGGAGGTGATCGCTTACAGCAGCAGTGAGACCCCTAACCGTCATGACAT 


1315 


2351 


• • • • • 

GCTCTTGGTGGAGCCGCTGAACCGACTCCTGCAGGACAAGTGGGACAGAT 
III 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


2400 


1316 


III 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
GCTTCTCGTGGAACCCTTGAACCGACTCCTACAGGACAAGTGGGACAGAT 


1365 


2401 


TC6TCAAGCGCATCTTCTACTTCAACTTCCTGGTCTACTGCCTGTACATG 
1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 III 


2450 


1366 


1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 III 
TTGTCAAGCGCATCTTCTACTTCAACTTCTTCGTCTACTGCTTGTATATG 


1415 


2451 


• • • • ■ 

ATCATCTTCACCATGGCTGCCTACTACAGGCCCGTGGATGGCTTGCCTCC 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 


2500 


1416 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 INI 1 1 1 1 1 1 1 1 1 1 1 1 1 II 
ATCATCTTCACCGCGGCTGCCTACTATCGGCCTGTGGAAGGCTTGCCCCC 


1465 


2501 


* • • • • 

CTTTAAGATGGAAAAAA. . . TTGGAGACTATTTCCGAGTTACTGGAGAGA 
II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 


2547 


1466 


II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 
CTATAAGCTGAAAAACACCGTTGGGGACTATTTCCGAGTCACCGGAGAGA 


1515 


2548 


• • • • • 

TCCTGTCTGTGTTAGGAGGAGTCTACTTCTTTTTCCGAGG6ATTCAGTAT 
II 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 III 


2597 


1516 


II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 III 
TCTTGTCTGTGTCAGGAGGAGTCTACTTCTTCTTCCGAGGGATTCAATAT 


1565 


2598 


• • • • • 

TTCCTGCAGAGGCGGCCGTCGATGAAGACCCTGTTTGTGGACAGCTACAG 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


2647 


1566 


1 1 II II II 1 II 1 II II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
TTCCTGCAGAGGCGACCATCCCTCAAGAGTTTGTTTGTGGACAGCTACAG 


1615 


2648 


■ • • • • 

TGAGATGCTTTTCTTTCTGCAGTCACTGTTCATGCTGGCCACCGTGGTGC 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


2697 


1616 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
TGAGATACTTTTCTTTGTACAGTCGCTGTTCATGCTGGTGTCTGTGGTAC 


1665 


2698 


• • • • • 

TGTACTTCAGCCACCTCAAGGAGTATGTGGCTTCCATGGTATTCTCCCTG 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


2747 


1666 


M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
TGTACTTCAGCCAACGCAAGGAGTATGTGGCTTCCATGGTGTTCTCCCTG 


1715 


2748 


• • • • • 

GCCTTGGGCTGGACCAACATGCTCTACTACACCCGCGGTTTCCAGCAGAT 


2797 


1716 


III lllllllllllllllllllllllll Mill II lllllllllll 

GCCATGGGCTGGACCAACATGCTCTACTATACCCGAGGATTCCAGCAGAT 


1765 


2798 


• • • • » 

GGGCATCTATGCCGTCATGATAGAGAAGATGATCCTGAGAGACCTGTGCC 


2847 


1766 


lllllllllll! Illlllll IIMIIMIIIIII lllllllllllll 

GGGCATCTATGCTGTCATGATTGAGAAGATGATCCTCAGAGACCTGTGCC 


1815 
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• • • • • 

2848 GTTTCATGTTTGTCTACATCGTCTTCTTGTTCGGGTTTTCCACAGCGGTG 2897 

i ii inn Mini mi iiiinii n iiiiiiimi in 

1816 GGTTTATGTTCGTCTACCTCGTGTTCTTGTTTGGATTTTCCACAGCTGTG 1865 

• • • • * 
2898 GTGACGCTGATTGAAGACGGGAAGAATGACTCCCTGCCGTCTGAGTCCAC 2947 

1oec inn minim ii iiiiiiiii mi inn iimiii 

1866 GTGACACTGATTGAGGATGGGAAGAATAACTCTCTGCCTATGGAGTCCAC 1915 

• • • • ♦ 

2948 GTCGCACAGGTGGCGGGGGCCTGCCTGCAGGCCCCCCGATAGCTCCTACA 2997 

1Q1C i mi in imn iiiiiiiii i n i n m mi 

1916 ACCACACAAGTGCCGGGGGTCTGCCTGCAAG. . . CCAGGTAACTCTTACA 1962 

• ♦ • • • 

2998 ACAGCCTGTACTCCACCTGCCTGGAGCTGTTCAAGTTCACCATCGGCATG 3047 

10 „ Minimi inn n mnimiimiimimmim 

1963 ACAGCCTGTATTCCACATGTCTGGAGCTGTTCAAGTTCACCATCGGCATG 2012 

• • • • • 
3048 GGCGACCTGGAGTTCACTGAGAACTATGACTTCAAGGCTGTCTTCATCAT 3097 

iiiiiiiimiiiiiiniiiiiii iiiiimiiiiiiiiiimii 

2013 GGCGACCTGGAGTTCACTGAGAACTACGACTTCAAGGCTGTCTTCATCAT 2062 

• • ♦ . • 
3098 CCTGCTGCTGGCCTATGTAATTCTCACCTACATCCTCCTGCTCAACATGC 3147 

, n „ mi i mimim iiiimmiiiim immiimi 

2063 CCTGTTACTGGCCTATGTGATTCTCACCTACATCCTTCTGCTCAACATGC 2112 
3148 TCATCGCCCTCATGGGTGAGACTGTCAACAAGATCGCACAGGAGAGCAAG 3197 

mi n iinimmm miiiimi inn iiiiiiiii 

2113 TCATTGCTCTCATGGGTGAGACCGTCAACAAGATTGCACAAGAGAGCAAG 2162 

• • . . 

3198 AACATCTGGAAGCTGCAGAGAGCCATCACCATCCTGGACACGGAGAAGAG 3247 

llimilllllllllllllllllllllllllllllll II MIMIM 

2163 AACATCTGGAAGCTGCAGAGAGCCATCACCATCCTGGATACAGAGAAGAG 2212 

• • • • * 

3248 CTTCCTTAAGTGCATGAGGAAGGCCTTCCGCTCAGGCAAGCTGCTGCAGG 3297 

,,,111111 llllllllllllllllllllllllll lllllllliliiiin 

2213 CTTCCTGAAGTGCATGAGGAAGGCCTTCCGCTCTGGCAAGCTGCTGCAGG 2262 

• • • • . 
3298 TGGGGTACACACCTGATGGCAAGGACGACTACCGGTGGTGCTTCAGGGTG 3347 

„„ linn in inn iimin miiiiiiimi iiiiiiiii 

2263 TGGGGTTCACTCCTGACGGCAAGGATGACTACCGGTGGTGTTTCAGGGTG 2312 

• • • • « 
3348 GACGAGGTGAACTGGACCACCTGGAACACCAACGTGGGCATCATCAACGA 3397 

„„ minim iimin mimiiiiiii urn minimi 

2313 GACGAGGTAAACTGGACTACCTGGAACACCAATGTGGGTATCATCAACGA 2362 

Fig. 9 continued 
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• • • 



3398 


AGACCCGGGCAACTGTGAGGGCGTCAAGCGCACCCTGAGCTTCTCCCTGC 


3447 


2363 


Mill llllllllllllllllllllllllllllllllllllllllll 

GGACCCAGGCAACTGTGAGGGCGTCAAGCGCACCCTGAGCTTCTCCCTGA 


2412 


3448 


• • • ♦ • 

GGTCAAGCAGAGTTTCAGGCAGACACTGGAAGAACTTTGCCCTGGTCCCC 

Mill II llllllllll III llllllllllllllllllllll III 

GGTCAGGCCGAGTTTCAGGGAGAAACTGGAAGAACTTTGCCCTGGTTCCC 


3497 


2413 


2462 


3498 


• • • ♦ • 

CTTTTAAGAGAGGCAAGTGCTCGAGATAGGCAGTCTGCTCAGCCCGAGGA 


3547 


2463 


III 1 II II Mill llllllllll II 1 1 MM II II 

CTTCTGAGGGATGCAAGCACTCGAGATAGACATGCCACCCAGCAGGAAGA 


2512 


3548 


• • • • • 

AGTTTATCTGCGACAGTTTTCAGGGTCTCTGAAGCCAGAGGACGCTGAGG 


3597 


2513 


MM 1 III II 1 1 1 II II II lllllllllll lllllll 

AGTTCAACTGAAGCATTATACGGGATCCCTTAAGCCAGAGGATGCTGAGG 


2562 


3598 


• • • • • 

TCTTCAAGAGTCCTGCCGCTTCCGGGGAGAAGTGA ♦ GGACGTCACGCAGA 


3646 


2563 


1 llllll II II llllllll 1 1 MM 1 Mil 

TTTTCAAGGATTCCATGGTCCCAGGGGAGAAATAATGGACACTATGCAGG 


2612 


3647 


• • • • • 

CAGCACTGTCAACACTGGGCCTTAGGAGACCCCGTTGCCACGGGGGGCTG 


3696 


2613 


1 II II lllllll III 

GATCAATG CGGGGTCTTTGGGTGGTCTG 


2640 


3697 


• • • * • 
CTGAGGGAACACCAGTGCTCTGTCAGCAGCCTGGCCTGGTCTGTGCCTGC 


3746 


2641 


II lllllll 1 1 II 1 1 1 II 1 lllllllllll 
CTTAGGGAAC . CAGCAGGGTTGACGTTATCTGGGTCCACTCTGTGCCTGC 


2689 


3747 


• • • # • 

CCA . GCATGTTCCCAAATCTGTGCTGGACAAGCTGTGGGAAGCGTTCTTG 


3795 


2690 


1 1 III MM 1 llllll llllllllll 1 1 

CTAGGCACATTCCTAGGACTTCGGCGGGCCTGCTGTGGGAA . CTGGGAGG 


2738 


3796 


♦ • • • • 

GAAGCATGGGGAGTGATGTACATCCAACCGTCACTGTCCCCAAGTGAATC 

1 1 1 Mill llllllll 1 1 II MM 1 

TGTGTGGGAATTGAGATGTGTATCCAACCATGA. . . TCTCCAAACATTTG 


3845 


2739 


2785 


3846 


• • • • • 

TCCTAACAGACTTTCAGGTTTTTACTCACTTTACTAAAAAAAAAAAAAAA 


3895 


2786 


1 1 1 MM II MM II 1 1 II III 

GCTTTCAACTCTTTATGGACTTTATTAAACAGAGTGAATGGCAAATCTCT 


2835 


3896 


AGGGCGGCCGCTTA 3909 




2836 


1 III 
ACTTGGACACAT . . 2847 
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GAP of: humanvrl.pep check: 6877 from: 1 to: 839 
humanVRl _Fbhl8547pat - fchrb87a6 / 3909 bases, 4554 checksum, 
to: ratvrl.pep check: 5764 from: 1 to: 838 

ratVRl | AF029310 Rattus norvegicus vanilloid receptor subtype 1 mRNA, 
complete 
cds. 

Symbol comparison table: 

/ddm_local/gcg/gcg_9 ♦ l/gcgcore/data/rundata/blosum62 .cmp 
CompCheck: 6430 

Gap Weight: 12 Average Match: 2.912 

Length Weight: 4 Average Mismatch: -2.003 

Quality: 3734 Length: 840 

Ratio: 4.456 Gaps: 3 



Percent Similarity: 89.247 Percent Identity: 86.022 



Match display thresholds for the alignment (s) : 
| = IDENTITY 
: = 2 
. = 1 

humanvrl.pep x ratvrl.pep 

..... 

1 MKKWSSTDLGTAADPLQKDTCPDPLDGDPNSRPPPAKPQLPTAKSRTRLF 50 

-I I • I ...|.||.|.|M.:|| II : I :||llll 
1 MEQRASLDSEESESPPQENSCLDPPDRDPNCKPPPVKPHIFTTRSRTRLF 50 

• • • » • 

51 GKGDSEEAFPVDCPHEEGELDSCPTITVSPVITIQRPGDGPTGARLLSQD 100 

Illlllll Mlhlll I III MM hlllllllll I III 

51 GKGDSEEASPLDCPYEEGGLASCPIITVSSVLTIQRPGDGPASVRPSSQD 100 

..... 

101 SVAASTEKTLRLYDRRSIFEAVAQNNCQDLESLLLFLQKSKKHLTDNEFK 150 

Ihl II IIIIIMIhlllhllhlllll Mhlll llhlll 

101 SVS AG . EKPPRL YDRRS IFDAVAQSNCQELESLLPFLQRSKKRLTDSEFK 149 

• . . • . 

151 DPETGKTCLLKAMLNLHDGQNTTIPLLLEIARQTDSLKELVNASYTDSYY 200 

llllllllllllllllhlll II llhMhllllh llllllllll 

150 DPETGKTCLLKAMLNLHNGQNDTIALLLDVARKTDSLKQFVNASYTDSYY 199 

• * . . . 

201 KGQTALHIAIERRNMALVTLLVENGADVQAAAHGDFFKKTKGRPGFYFGE 250 

IMIIIIIIIIIIII 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 i . i 1 1 1 1 1 E 1 1 1 1 1 1 1 1 1 ! 

200 KGQTALHIAIERRNMTLVTLLVENGADVQAAANGDFFKKTKGRPGFYFGE 249 

Fig. 10 
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• # • * 



251 


LPLSLAACTNQLGIVKFLLQNSWQTADISARDSVGNTVLHALVEVADNTA 

llllllllllll lllllllllll MINIM! MINI MINI Ml 


300 


250 


1 1 1 1 1 II 1 1 1 1 i lllllllllll I I I 1 I 1 I I 1 I I I I I I I I I I I I I I I 
LPLSLAACTNQLAIVKFLLQNSWQPADISARDSVGNTVLHALVEVADNTV 


299 


301 


DNTKFVTSMYNEILMLGAKLHPTLKLEELTNKKGMTPLALAAGTGKIGVL 

II IMMMIII IMI 1 IMIIMIM MIMIMMIII II -111111 


350 


300 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 • 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 • 1 1 1 1 M 

DNTKFVTSIIYIIEILILGAKLHPTLKLEEITNRKGLTPLALAASSGKIGVL 


349 


351 


AYILQREIQEPECRHLSRKFTEWAYGPVHSSLYDLSCIDTCEKNSVLEVI 

IMIIMI 1 M 1 1 !! 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 M 1 II 1! 1 M 1 1 1 M 1 1 1 


400 


350 


AYILQREIHEPECRHLSRKFTEWAYGPVHSSLYDLSCIDTCEKNSVLEVI 


399 


401 


AYSSSETPNRHDMLLVEPLNRLLQDKWDRFVKRIFYFNFLVYCLYMIIFT 

MMIMMMIMMMMMIMMIMMMIMM III MUM 


450 


400 


AYSSSETPNRHDMLLVEPLNRLLQDKWDRFVKRIFYFNFFVYCLYMIIFT 


449 


451 


MAAYYRPVDGLPPFKMEK . IGDYFRVTGEILSVLGGVYFFFRGIQYFLQR 

II llllhl lll:|:. HMIINIINN 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 
I I I I i l l * ll I l ' l * * * I I I I I i I 1 1 I M i i i i i i i i i i i i i i i i i 

AAAYYRPVEGLPPYKLKNTVGDYFRVTGEILSVSGGVYFFFRGIQYFLQR 


499 


450 


499 


500 


RPSHKTLFVDSYSEMLFFLQSLFMLATWLYFSHLKEYVASMVFSLALGW 

MMMIMIIMMMMMIIM .111111 lllllllllllhll 

RPSLKSLFVDSYSEILFFVQSLFMLVSWLYFSQRKEYVASMVFSLAMGW 


549 


500 


549 


550 


• • • • * 

TNMLYYTRGFQQMGIYAVMIEKMILRDLCRFMFVYIVFLFGFSTAWTLI 

llllllllllllllllllllllllllllllllllhllllllllllllll 


599 


550 


TNMLYYTRGFQQMGIYAVMIEKMILRDLCRFMFVYLVFLFGFSTAWTLI 


599 


600 


EDGKNDSLPSESTSHRWRGPACRPPDSSYNSLYSTCLELFKFTIGMGDLE 

1 1 1 1 1. 1 1 1 III 1 : II II: 1 . 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
11111*111 III I ' II I I • I * I I I I I I I I I I l I l l 1 1 l l ll l l I 

EDGKNNSLPMESTPHKCRGSACK . PGNSYNSLYSTCLELFKFTIGMGDLE 


649 


600 


648 


650 


FTENYDFKAVFIILLLAYVILTYILLLNMLIALMGETVNKIAQESKNIWK 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 
1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 

FTENYDFKAVFIILLLAYVILTYILLLNMLIALMGETVNKIAQESKNIWK 


699 


649 


698 


700 


LQRAITILDTEKSFLKCMRKAFRSGKLLQVGYTPDGKDDYRWCFRVDEVN 

MINI IMMIIM II MIMI lllllll MINIM Ml IMMIMI 


749 


699 


1 II If II II 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 M 

LQRAITILDTEKSFLKCMRKAFRSGKLLQVGFTPDGKDDYRWCFRVDEVN 


748 


750 


• • • • • 

WTTWNTNVGIINEDPGNCEGVKRTLSFSLRSSRVSGRHWKNFALVPLLRE 


799 


749 


lllllllllllllllllllllllllllllll 1 M 1 1 - M 1 i 1 1 1 II M : 
WTTWNTNVGIINEDPGNCEGVKRTLSFSLRSGRVSGRNWKNFALVPLLRD 


798 


800 


• • * • 

ASARDRQSAQPEEVYLRQFSGSLKPEDAEVFKSPAASGEK 839 
II III • 1 III N -.111111111111 III 




799 


ASTRDRHATQQEEVQLKHYTGSLKPEDAEVFKDSMVPGEK 838 
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CLUSTAL W (1.74) multiple sequence alignment 

humanVR2.alt 

human VR2 mtspssspvfrletldggqedgseadrgkldfgsglppmesqfqgedrkfapqirvnlny 

humanVR2.alt - - 

human VR2 rkgtgasqpdpnrfdrdrlfnavsrgvpedlaglpeylsktskyltdseytegstgktcl 

humanVR2.alt --- - — 

human VR2 mkavlnlkdgvnacilpllqidrdsgnpqplvnaqctddyyrghsalhiaiekrslqcvk 

humanVR2.alt grffqkgqgtcfyfgelplslaactkqwdwsyllenphqpaslqa 

human VR2 llvenganvharacgrffqkgqgtcfyfgelplslaactkqwdwsyllenphqpaslqa 

********************************************** 

humanVR2.alt tdsqgntvlhalvmisdnsaenialvtsmydgllqagarlcptvqledirnlqdltplkl 
human VR2 tdsqgntvlhalvmisdnsaenialvtsmydgllqagarlcptvqledirnlqdltplkl 

************************************************************ 

humanVR2.alt AAKEGKIEIFRHILQREFSGLSHLSRKFTEWCYGPVRVSLYDLASVDSCEENSVLEIIAF 

human VR2 AAKEGKIEIFRHILQREFSGLSHLSRKFTEWCYGPVRVSLYDLASVDSCEENSVLEIIAF 
************************************************************ 

humanVR2.alt HCKS PHRHRMVVLEPLNKLLQAKWDLLIPKFFDNFLCNLI YMFI FTAVAYHQPTLKKQAA 

human VR2 HCKS PHRHRMVVLEPLNKLLQAKWDLLIPKFFLNFLCNLI YMF IFTAVAYHQPTLKKQAA 
************************************************************ 

hu ma n VR2 .alt PHLKAEVGNSMLLTGHILILLGGI YLLVGQLWYFWRRHVFIWI sfidsyfeilflfqall 
human VR2 phlkaevgnsmlltghylillggiyllvgqlwyfwrrhvfiwisfidsyfeilflfqall 

************************************************************ 

humanVR2.alt tvvsqvlcflaiewylpllvsalvlgwlnllyytrgfqhtgiysvmiq 

human VR2 twsqvlcfiaiewylpllvsalvlgwlnllyytrgfqhtgiysvmiqkvilrdllrfll 
************************************************ 

humanVR2.alt - - - 

human VR2 iylvflfgfavalvslsqeawrpeaptgpnatesvqpmegqedegngaqyrgileaslel 

humanVR2.alt - 

human VR2 fkftigmgeiafqeqlhfrgmvlllllayvlltyilllnmlialmsetvnsvatdswsiw 

humanVR2.alt --kkaisvlfjsiengywwcrkkqragvmltvgtkpdgspderwcfrveevnwasweqtlpt 

human VR2 klqkaisvlemengywwcrkkqragvmltvgtkpdgspderwcfrveevnwasweqtlpt 

.********************************************************* 

humanVR2.alt lcedpsgagvprtlenpvlasppkededgaseenyvpvqllqsn 

human VR2 lcedpsgagvprtlenpvlasppkededgaseenyvpvqllqsn 
******************************************** 
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Protein Family / Domain Matches, HAAMer version 2 

Searching for complete domains 

hmmpfam - search a single seq against HMM database 

HMMER 2.1.1 (Dec 1998) 

Copyright (C) 1992-1998 Washington University School of Medicine 
HMMER is freely distributed under the GNU General Public License (GPL) . 



HMM file: /prod/ddm/seqanal/PFAM/pfam4.2/Pfam 

Sequence file: /usr/ns-home/docs/seqanal/orfanal/oa-script 18670 seq 



Query: hVR-1 

Scores for sequence family classification (score includes all domains): 
Model Description Score E-value N 



ank Ank repeat 51.5 1.9e-ll 3 

Parsed for domains: 



Model 


Domain 


seq-f 


seq-t 


hmro-f 


hnnn-t 




score 


E-value 


ank 


1/3 


201 


233 .. 


1 


33 


[] 


34.4 


2.6e-06 


ank 


2/3 


248 


283 .. 


1 


33 


[] 


13.2 


2 


ank 


3/3 


333 


361 .. 


1 


33 


[] 


3.4 


26 



Alignments of top-scoring domains: 

ank: domain 1 of 3, from 201 to 233: score 34.4, E = 2.6e-06 

* - >nGnTPLHlAarygnve wklLLehGAdvnartk< - * 
+G+T+LH+A + n+ +v lL+e+GAdv a+ 



hVR-1 201 KGQTALHIAIERRNMALVTLLVENGADVQAAAH 233 

ank: domain 2 of 3, from 248 to 283: score 13.2, E = 2 

*->nGnTPLHlAarygnvewklLLe. . .hGAdvnartk<-* 
G PL lAa ++++ +vk+LL+++ + Ad+ ar+ 
hVR-1 248 FGELPLSLAACTNQLGIVKFLLQnswQTADISARDS 283 

ank: domain 3 of 3, from 333 to 361: score 3.4, E = 26 

* - >nGnTPLHlAarygnvewklLLehGAdvnartk< - * 
+G TPL lAa +g++ v ++ L+ ++ 
hVR-1 333 KGMTPLALAAGTGKIGVLAYILQ REIQEP 361 

Fig. 13 
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Protein Family / Domain Matches, HMMer version 2 

Searching for complete domains 

hmmpfam - search a single seg against HMM database 

HMMER 2.1.1 (Dec 1998) 

Copyright (C) 1992-1998 Washington University School of Medicine 
HMMER is freely distributed under the GNU General Public License (GPL) . 

HMM file: /prod/ddm/seqanal/PFAM/pfam4.2/Pfam 
Sequence file: /tmp/orfanal.5/g.aa 

Query: Flh21ell 

Scores for sequence family classification (score includes all domains): 
Model Description Score E-value 



N 



ank PF00023 Ank repeat 53.7 4e-12 

Parsed for domains: 

Model Domain seq-f seq-t hmm-f hmm-t score E-value 



ank 1/3 162 194 . . 1 33 [] 38.3 1.7e-07 

ank 2/3 208 243 .. 1 33 [] 6.4 4.3 

ank 3/3 293 328 .. 1 33 [] 8.8 2.1 

Alignments of top-scoring domains: 

ank: domain 1 of 3, from 162 to 194: score 38.3, E = 1.7e-07 

* - >nGnTPLHlAarygnve wklLLehGAdvnartk< - * 
+G+++LH+A ++ ++++vklL+e+GA+v+ar 
Flh21ell 162 RGHSALHIAIEKRSLQCVKLLVENGANVHARAC 194 

ank: domain 2 of 3, from 208 to 243: score 6.4, E = 4.3 

*->nGnTPLHlAarygnvawklLLe. . .hGAdvnartk<-* 
G PL lAa + +++w +LLe++++ A+ a++ 
Flh21ell 208 FGELPLSLAACTKQWDWSYGLEnphQPASLQATDS 243 

ank: domain 3 of 3, from 293 to 328: score 8.8, E = 2.1 

*->nGnTPLHLAarygnvewklLLe. . .hGAdvnart«-* 
+ +TPL lAa++g++e+ + L+++ G + +r 
Flh21ell 293 QDLTPLKLAAKEQKLEIFRHILQrafSGLSHLSRK 328 



Fig. 15 
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CLUSTAL W (1.74) multiple sequence alignment 

h\/2o an iYS 2 MTSPSSSPVFRLETLDGGQEDGSEADR6KLDFGSGLPPMESQFQGEDRKFAPQIRVNLNY 
nVKZ.althL MTSPSSSPVFRLETLDGGQEDGSEADRGKLDFGSGLPPMESQFQGEDRKFAPQIRVNLNy 
************************************************************ 

humanVR2 rkgtgasqpdpnrfdrdrlfnavsrgvpedlaglpeylsktskyltdseytegstgktcl 

hVR2.altFL RKGTGASQPDPNRFDRDRLFNAVSRGVPEDLAGLPEYLSKTSKyLTDSEYTEGSTGKTCL 
************************************************************ 

humanVR2 mkavlnlkdgvnacilpllqidrdsgnpqplvnaqctddyyrghsalhiaiekrslqcvk 
hVR2.altFL mkavlnlkdgvnacilpllqidrdsgnpqplvnaqctddyyrghsalhiaiekrslqcvk 

************************************************************ 

Dv U /!^ an y R2 llvenganvharacgrffqkgqgtcfyfgelplslaactkqwdwsyllenphqpaslqa 
hVR2.altFL llvenganvharacgrffqkgqgtcfyfgelplslaactkqwdwsyllenphqpaslqa 

************************************************** ********** 

h u / m anVR2 TDSQGM!VLHALVMISDNSAENIALVTSMYDGLLQAGARLCPTVQIjEDIRNLQDLTPLKL 

hVR2.altFL TDSQGNTVLHALVMISDNSAENIALWSMYDGLLQAGARLCPTVQLEDIRNLQDLTPLKL 
*************************************************** ********* 

h u manVR2 aakegkieifrhilqrefsglshlsrkftewcygpvrvslydlasvdsceensvleiiaf 
hVR2. altFL aakegkieifrhilqrefsglshlsrkftewcygpvrvslydlasydsceensvleiiaf 

************************************************************ 

huj^ci nVR2 hcksphrhrm\a^eplnkllqakwdllipkfflnflcnliymfiftavayhqptlkkqaa 
nVR2. altFL hcksphrhrmvvleplnkllqakwdllipkfflnflcnliymfiftavayhqptlkkqaa 

************************************* *********************** 

Dv U /™ a n Y R2 phlkaevgnsmlltghilillggiyllvgqlwyfwrrhvfiwisfidsyfeilflfqall 
hVR2.altFL phlkaevgnsmlltghilillggiyllvgqlwyfwrrhvfiwisfidsyfeilflfqall 

******************************************************** **** 

!™™ an , VR2 twsqvlcflaiewylpllvsalvlgwlnllyytrgfqhtgiysvmiqkvilrdllrfll 
hVR2 . altFL twsqvlcflaiewylpllvsalvlgwlnllyytrgfqhtgiysvmiqk 

******************************************* ****** 

hunxinVR2 iylvflfgfavalvslsqeawrpeaptgpnatesvqpmegqedegngaqyrgileaslel 

!HIIP n yj? 2 FKFTIGMGELAFQEQLHFRGMVLLLLIAYVLLTYILLLNMLIALMSETVNSVATDSWSIW 

hVR2. altFL 

k\/Do an VR 2 KLQKAISVLEMENGYWWCRKKQRAGVMLTVGTKPDGSPDERWCFRVEEVNWASWEQTLPT 

nVKZ.altFL KAISVLEMENGYWWCRKKQRAGVMLTVGTKPDGSPDERWCFRVEEVNWASWEQTLPT 

******************************************* ************** 

hu ma nVR2 lcedpsgagvprtlenpvlasppkededgaseenyvpvqllqsn 
hVR2 .altFL lcedpsgagvprtlenpvlasppkededgaseenyvpvqllqsn 

******************************************** 

Fig. 17 
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SEQUENCE LISTING 
<110> MILLENNIUM PHARMACEUTICALS , INC. 

<120> NOVEL MEMBERS OF THE CAPSAICIN/VANILLOID RECEPTOR 
FAMILY OF PROTEINS AND USES THEREOF 

<130> MNI-062CP2PC 



<140> 
<141> 

<150> 60/108,322 
<151> 1998-11-13 

<150> 60/114, 078 
<151> 1998-12-28 

<150> 09/258, 633 
<151> 1999-02-26 

<150> 09/421, 134 
<151> 1999-10-19 

<160> 20 

<170> Patentln Ver. 2.0 

<210> 1 
<211> 3909 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 

<222> (1113) . . (3629) 
<400> 1 



gtgagcgcaa 


cgcactgcgg 


gcagtgagcg 


caacgcactg 


cgggcagtga 


gcgcaacgca 


60 


ctgcgggcag 


tgagcgcaac 


gcactgcggg 


cagtgagcgc 


aacgcactgc 


gggcagtgag 


120 


cgcaacgcac 


tgcgggcagt 


gagcgcaacg 


cactgcgggc 


agtgagcgca 


acgcacttgc 


180 


gggcagtgag 


cgcaacgcac 


tgcgggcagt 


gagcgcaacg 


cactgcgggc 


agtgagcgca 


240 


acgcactgcg 


ggcagtgagc 


gcaacgcact 


gcgggcagtg 


agcgcaacgc 


actgcgggca 


300 


gtgagcgcaa 


cgcactgcgg 


gcagtgagcg 


caacgcactg 


cgggcagtga 


gcgcaacgca 


360 


ctgcgggcag 


tgagcgcaac 


gcactgcggg 


cagtgagcgc 


aacgcactgc 


gggcagtgag 


420 


cgcaacgcac 


ttaatgtgag 


ttagctcact 


cattaggcac 


cccaggcttt 


acactttatg 


480 


cttccggctc 


gtatgttgtg 


tggaattgtg 


agcggataac 


aatttcacac 


aggaaacagc 


540 


tatgaccatg 


attacgccaa 


gctctaatac 


gactcactat 


agggaaagct 


ggtacgcctg 


600 
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caggtaccgg tccggaattc ccgggtcgac ccacgcgtcc gaaaacacac ctctctgctg 660 

tgggaagact gtgcaatggc acagccgcag agcttggttt gggaggttga agtgctctgg 720 

ggagaattcg tagatcatcc tcagaaaagc cttgccctgg tgttctacca gaaaaacgtc 780 

tcccaatcac ccagaaaagc tgtccacagt agtcccccct tatccacggg tgtcactttc 840 

catgggttca gttatttgcg gtcaaccacg gtctgccaat attaaatgga aaattcttca 900 

aacagttccc aagttttccc ttgtgcattg ttctgagcag tgtgatgaag agtctctgcc 960 

gtgccatctg ggatgcaaac cgtccctgtg tcccccacgt ccaggccgta gatgctcccc 1020 

gccggtcagt cacttagtcg tcagatcgcc cgtcctggta tcacagtgct tctgttcagg 1080 

ttgcacactg ggccacagag gatccagcaa gg atg aag aaa tgg age age aca 1133 

Met Lys Lys Trp Ser Ser Thr 

1 5 

gac ttg ggg aca get gcg gac cca etc caa aag gac ace tgc cca gac 1181 

Asp Leu Gly Thr Ala Ala Asp Pro Leu Gin Lys Asp Thr Cys Pro Asp 

10 15 20 

ccc ctg gat gga gac cct aac tec agg cca cct cca gec aag ccc cag 1229 
Pro Leu Asp Gly Asp Pro Asn Ser Arg Pro Pro Pro Ala Lys Pro Gin 
25 30 35 

etc ccc acg gec aag age cgc ace egg etc ttt ggg aag ggt gac teg 1277 
Leu Pro Thr Ala Lys Ser Arg Thr Arg Leu Phe Gly Lys Gly Asp Ser 
40 45 50 55 

gag gag get ttc ccg gtg gat tgc ccc cac gag gaa ggt gag ttg gac 1325 
Glu Glu Ala Phe Pro Val Asp Cys Pro His Glu Glu Gly Glu Leu Asp 
60 65 70 

tec tgc ccg acc ate aca gtc age cct gtt ate ace ate cag agg cca 1373 
Ser Cys Pro Thr He Thr Val Ser Pro Val He Thr He Gin Arg Pro 
75 80 85 

gga gac ggc ccc acc ggt gee agg ctg ctg tec cag gac tct gtc gec 1421 
Gly Asp Gly Pro Thr Gly Ala Arg Leu Leu Ser Gin Asp Ser Val Ala 
90 95 100 

gec age acc gag aag acc etc agg etc tat gat cgc agg agt ate ttt 14 69 
Ala Ser Thr Glu Lys Thr Leu Arg Leu Tyr Asp Arg Arg Ser He Phe 
105 110 115 

gaa gee gtt get cag aat aac tgc cag gat ctg gag age ctg ctg etc 1517 
Glu Ala Val Ala Gin Asn Asn Cys Gin Asp Leu Glu Ser Leu Leu Leu 
120 125 130 135 

ttc ctg cag aag age aag aag cac etc aca gac aac gag ttc aaa gac 1565 
Phe Leu Gin Lys Ser Lys Lys His Leu Thr Asp Asn Glu Phe Lys Asp 
140 145 150 

cct gag aca ggg aag acc tgt ctg ctg aaa gee atg etc aac ctg cac 1613 
Pro Glu Thr Gly Lys Thr Cys Leu Leu Lys Ala Met Leu Asn Leu His 
155 160 165 
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gac gga cag aac acc acc ate ccc ctg, etc ctg gag ate gcg egg caa 1661 
Asp Gly Gin Asn Thr Thr lie Pro Leu Leu Leu Glu lie Ala Arg Gin 
170 175 180 

acg gac age ctg aag gag ctt gtc aac gec age tac acg gac age tac 1709 
Thr Asp Ser Leu Lys Glu Leu Val Asn Ala Ser Tyr Thr Asp Ser Tyr 
185 190 195 

tac aag ggc cag aca gca ctg cac ate gec ate gag aga cgc aac atg 1757 
Tyr Lys Gly Gin Thr Ala Leu His lie Ala lie Glu Arg Arg Asn Met 
200 205 210 215 



gec ctg gtg acc etc ctg gtg gag aac gga gca gac gtc cag get gcg 
Ala Leu Val Thr Leu Leu Val Glu Asn Gly Ala Asp Val Gin Ala Ala 
220 225 230 



1805 



qcc cat ggg gac ttc ttt aag aaa acc aaa ggg egg cct gga ttc tac 1853 
Ala His Gly Asp Phe Phe Lys Lys Thr Lys Gly Arg Pro Gly Phe Tyr 
235 240 245 

ttc not gaa ctg ccc ctg tec ctg gec gcg tgc acc aac cag ctg ggc 1901 
Phe Gly Glu Leu Pro Leu Ser Leu Ala Ala Cys Thr Asn Gin Leu Gly 
250 255 260 

ate gtg aag ttc ctg ctg cag aac tec tgg cag acg gee gac ate age 194 9 
lie Val Lys Phe Leu Leu Gin Asn Ser Trp Gin Thr Ala Asp lie Ser 
265 270 275 

qcc agg gac teg gtg ggc aac acg gtg ctg cac gee ctg gtg gag gtg 1997 
Ala Arg Asp Ser Val Gly Asn Thr Val Leu His Ala Leu Val Glu Val 
280 285 290 295 

gee gac aac acg gee gac aac acg aag ttt gtg acg age atg tac aat 2045 
Ala Asp Asn Thr Ala Asp Asn Thr Lys Phe Val Thr Ser Met Tyr Asn 
300 305 310 

gag att ctg atg ctg ggg gee aaa ctg cac ccg acg ctg aag ctg gag 2093 
Glu lie Leu Met Leu Gly Ala Lys Leu His Pro Thr Leu Lys Leu Glu 
315 320 325 

gag etc acc aac aag aag gga atg acg ccg ctg get ctg gca get ggg 2141 
Glu Leu Thr Asn Lys Lys Gly Met Thr Pro Leu Ala Leu Ala Ala Gly 
330 335 340 

acc ggg aag ate ggg gtc ttg gec tat att etc cag egg gag ate cag 2189 
Thr Gly Lys lie Gly Val Leu Ala Tyr lie Leu Gin Arg Glu lie Gin 
345 350 355 

gag ccc gag tgc agg cac ctg tec agg aag ttc acc gag tgg gee tac 2237 
Glu Pro Glu Cys Arg His Leu Ser Arg Lys Phe Thr Glu Trp Ala Tyr 
360 365 370 375 

ggg ccc gtg cac tec teg ctg tac gac ctg tec tgc ate gac acc tgc 2285 
Gly Pro Val His Ser Ser Leu Tyr Asp Leu Ser Cys lie Asp Thr Cys 
380 385 390 

gag aag aac teg gtg ctg gag gtg ate gee tac age age age gag acc 2333 
Glu Lys Asn Ser Val Leu Glu Val lie Ala Tyr Ser Ser Ser Glu Thr 
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395 400 405 

cct aat cgc cac gac atg etc ttg gtg gag ccg ctg aac cga etc ctg 2381 
Pro Asn Arg His Asp Met Leu Leu Val Glu Pro Leu Asn Arg Leu Leu 
410 415 420 



cag gac aag tgg gac aga ttc gtc aag cgc ate ttc tac ttc aac ttc 2429 
Gin Asp Lys Trp Asp Arg Phe Val Lys Arg lie Phe Tyr Phe Asn Phe 
425 430 435 

ctg gtc tac tgc ctg tac atg ate ate ttc ace atg get gec tac tac 2477 
Leu Val Tyr Cys Leu Tyr Met lie lie Phe Thr Met Ala Ala Tyr Tyr 
440 445 450 455 

agg ccc gtg gat ggc ttg cct ccc ttt aag atg gaa aaa att gga gac 2525 
Arg Pro Val Asp Gly Leu Pro Pro Phe Lys Met Glu Lys lie Gly Asp 
460 465 470 

tat ttc cga gtt act gga gag ate ctg tct gtg tta gga gga gtc tac 2573 
Tyr Phe Arg Val Thr Gly Glu lie Leu Ser Val Leu Gly Gly Val Tyr 
475 480 485 

ttc ttt ttc cga ggg att cag tat ttc ctg cag agg egg ccg teg atg 2621 
Phe Phe Phe Arg Gly lie Gin Tyr Phe Leu Gin Arg Arg Pro Ser Met 
490 495 500 

aag ace ctg ttt gtg gac age tac agt gag atg ctt ttc ttt ctg cag 2669 
Lys Thr Leu Phe Val Asp Ser Tyr Ser Glu Met Leu Phe Phe Leu Gin 
505 510 515 

tea ctg ttc atg ctg gee ace gtg gtg ctg tac ttc age cac etc aag 2717 
Ser Leu Phe Met Leu Ala Thr Val Val Leu Tyr Phe Ser His Leu Lys 
520 525 530 535 

gag tat gtg get tec atg gta ttc tec ctg gee ttg ggc tgg ace aac 2765 
Glu Tyr Val Ala Ser Met Val Phe Ser Leu Ala Leu Gly Trp Thr Asn 
540 545 550 

atg etc tac tac ace cgc ggt ttc cag cag atg ggc ate tat gee gtc 2813 
Met Leu Tyr Tyr Thr Arg Gly Phe Gin Gin Met Gly lie Tyr Ala Val 
555 560 565 

atg ata gag aag atg ate ctg aga gac ctg tgc cgt ttc atg ttt gtc 2861 
Met lie Glu Lys Met lie Leu Arg Asp Leu Cys Arg Phe Met Phe Val 
570 575 580 

tac ate gtc ttc ttg ttc ggg ttt tec aca gcg gtg gtg acg ctg att 2909 
Tyr lie Val Phe Leu Phe Gly Phe Ser Thr Ala Val Val Thr Leu He 
585 590 595 

gaa gac ggg aag aat gac tec ctg ccg tct gag tec acg teg cac agg 2957 
Glu Asp Gly Lys Asn Asp Ser Leu Pro Ser Glu Ser Thr Ser His Arg 
600 605 610 615 

tgg egg ggg cct gee tgc agg ccc ccc gat age tec tac aac age ctg 3005 
Trp Arg Gly Pro Ala Cys Arg Pro Pro Asp Ser Ser Tyr Asn Ser Leu 
620 625 630 
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tac tec acc tgc ctg gag ctg ttc aag ttc acc ate ggc atg ggc gac 3053 

Tyr Ser Thr Cys Leu Glu Leu Phe Lys Phe Thr He Gly Met Gly Asp 

635 640 645 

ctg gag ttc act gag aac tat gac ttc aag get gtc ttc ate ate ctg 3101 

Leu Glu Phe Thr Glu Asn Tyr Asp Phe Lys Ala Val Phe He He Leu 

650 655 660 



ctg ctg gec tat gta att etc acc tac ate etc ctg etc aac atg etc 
Leu Leu Ala Tyr Val He Leu Thr Tyr He Leu Leu Leu Asn Met Leu 
665 670 675 



3149 



ate gec etc atg ggt gag act gtc aac aag ate gca cag gag age aag 
He Ala Leu Met Gly Glu Thr Val Asn Lys He Ala Gin Glu Ser Lys 
680 685 690 695 



3197 



aac ate tgg aag ctg cag aga gec ate acc ate ctg gac acg gag aag 
Asn He Trp Lys Leu Gin Arg Ala lie Thr lie Leu Asp Thr Glu Lys 
700 705 710 



3245 



age ttc ctt aag tgc atg agg aag gee ttc cgc tea ggc aag ctg ctg 
Ser Phe Leu Lys Cys Met Arg Lys Ala Phe Arg Ser Gly Lys Leu Leu 
715 720 725 



3293 



cag gtg ggg tac aca cct gat ggc aag gac gac tac egg tgg tgc ttc 
Gin Val Gly Tyr Thr Pro Asp Gly Lys Asp Asp Tyr Arg Trp Cys Phe 
730 735 740 



3341 



agg gtg gac gag gtg aac tgg acc acc tgg aac acc aac gtg ggc ate 
Arg Val Asp Glu Val Asn Trp Thr Thr Trp Asn Thr Asn Val Gly He 
745 750 755 



3389 



ate aac gaa gac ccg ggc aac tgt gag ggc gtc aag cgc acc ctg age 
He Asn Glu Asp Pro Gly Asn Cys Glu Gly Val Lys Arg Thr Leu Ser 
760 765 770 775 



3437 



ttc tec ctg egg tea age aga gtt tea ggc aga cac tgg aag aac ttt 
Phe Ser Leu Arg Ser Ser Arg Val Ser Gly Arg His Trp Lys Asn Phe 
780 785 790 



3485 



gee ctg gtc ccc ctt tta aga gag gca agt get cga gat agg cag tct 
Ala Leu Val Pro Leu Leu Arg Glu Ala Ser Ala Arg Asp Arg Gin Ser 
795 800 805 



3533 



get cag ccc gag gaa gtt tat ctg cga cag ttt tea ggg tct ctg aag 
Ala Gin Pro Glu Glu Val Tyr Leu Arg Gin Phe Ser Gly Ser Leu Lys 
810 815 820 



3581 



cca gag gac get gag gtc ttc aag agt cct gee get tec ggg gag aag 
Pro Glu Asp Ala Glu Val Phe Lys Ser Pro Ala Ala Ser Gly Glu Lys 
825 830 835 



3629 



tgaggacgtc aegcagacag cactgtcaac actgggcett aggagacccc gttgecaegg 3689 
ggggctgctg agggaacacc agtgctctgt cagcagcctg gcctggtctg tgcctgccca 3749 
gcatgttccc aaatctgtgc tggacaagct gtgggaagcg ttcttggaag catggggagt 3809 
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gatgtacatc caaccgtcac tgtccccaag tgaatctcct aacagacttt caggttttta 3869 
ctcactttac taaaaaaaaa aaaaaaaggg cggccgctta 3909 



<210> 2 
<211> 839 
<212> PRT 

<213> Homo sapiens 
<400> 2 

Met Lys Lys Trp Ser Ser Thr Asp Leu Gly Thr Ala Ala Asp Pro Leu 
1 5 10 15 

Gin Lys Asp Thr Cys Pro Asp Pro Leu Asp Gly Asp Pro Asn Ser Arg 
20 25 30 

Pro Pro Pro Ala Lys Pro Gin Leu Pro Thr Ala Lys Ser Arg Thr Arg 
35 40 45 

Leu Phe Gly Lys Gly Asp Ser Glu Glu Ala Phe Pro Val Asp Cys Pro 
50 55 60 

His Glu Glu Gly Glu Leu Asp Ser Cys Pro Thr lie Thr Val Ser Pro 
65 70 75 80 

Val lie Thr lie Gin Arg Pro Gly Asp Gly Pro Thr Gly Ala Arg Leu 
85 90 95 

Leu Ser Gin Asp Ser Val Ala Ala Ser Thr Glu Lys Thr Leu Arg Leu 
100 105 110 

Tyr Asp Arg Arg Ser lie Phe Glu Ala Val Ala Gin Asn Asn Cys Gin 
115 120 125 

Asp Leu Glu Ser Leu Leu Leu Phe Leu Gin Lys Ser Lys Lys His Leu 
130 135 140 

Thr Asp Asn Glu Phe Lys Asp Pro Glu Thr Gly Lys Thr Cys Leu Leu 
145 150 155 160 

Lys Ala Met Leu Asn Leu His Asp Gly Gin Asn Thr Thr lie Pro Leu 
165 170 175 

Leu Leu Glu lie Ala Arg Gin Thr Asp Ser Leu Lys Glu Leu Val Asn 
180 185 190 

Ala Ser Tyr Thr Asp Ser Tyr Tyr Lys Gly Gin Thr Ala Leu His lie 
195 200 205 

Ala lie Glu Arg Arg Asn Met Ala Leu Val Thr Leu Leu Val Glu Asn 
210 215 220 
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Gly Ala Asp Val Gin Ala Ala Ala His Gly Asp Phe Phe Lys Lys Thr 
225 230 235 240 

Lys Gly Arg Pro Gly Phe Tyr Phe Gly Glu Leu Pro Leu Ser Leu Ala 
245 250 255 

Ala Cys Thr Asn Gin Leu Gly lie Val Lys Phe Leu Leu Gin Asn Ser 
260 265 270 

Trp Gin Thr Ala Asp lie Ser Ala Arg Asp Ser Val Gly Asn Thr Val 
275 280 285 



Leu His Ala Leu 
290 

Phe Val Thr Ser 
305 

His Pro Thr Leu 



Pro Leu Ala Leu 
340 

lie Leu Gin Arg 
355 

Lys Phe Thr Glu 
370 

Leu Ser Cys lie 
385 

Ala Tyr Ser Ser 



Glu Pro Leu Asn 
420 

Arg lie Phe Tyr 
435 

Phe Thr Met Ala 
450 

Lys Met Glu Lys 
465 

Ser Val Leu Gly 



Leu Gin Arg Arg 
500 

Glu Met Leu Phe 
515 

Leu Tyr Phe Ser 



Val Glu Val Ala 
2 95 

Met Tyr Asn Glu 
310 

Lys Leu Glu Glu 
325 

Ala Ala Gly Thr 



Glu lie Gin Glu 
360 

Trp Ala Tyr Gly 
375 

Asp Thr Cys Glu 
390 

Ser Glu Thr Pro 
405 

Arg Leu Leu Gin 



Phe Asn Phe Leu 
440 

Ala Tyr Tyr Arg 
455 

lie Gly Asp Tyr 
470 

Gly Val Tyr Phe 
485 

Pro Ser Met Lys 



Phe Leu Gin Ser 
520 

His Leu Lys Glu 



Asp Asn Thr Ala 
300 

lie Leu Met Leu 
315 

Leu Thr Asn Lys 
330 

Gly Lys lie Gly 
345 

Pro Glu Cys Arg 



Pro Val His Ser 
380 

Lys Asn Ser Val 
395 

Asn Arg His Asp 
410 

Asp Lys Trp Asp 
425 

Val Tyr Cys Leu 



Pro Val Asp Gly 
4 60 

Phe Arg Val Thr 
475 

Phe Phe Arg Gly 
490 

Thr Leu Phe Val 
505 

Leu Phe Met Leu 



Tyr Val Ala Ser 



Asp Asn Thr Lys 



Gly Ala Lys Leu 
320 

Lys Gly Met Thr 
335 

Val Leu Ala Tyr 
350 

His Leu Ser Arg 
365 

Ser Leu Tyr Asp 



Leu Glu Val lie 
400 

Met Leu Leu Val 
415 

Arg Phe Val Lys 
430 

Tyr Met lie He 
445 

Leu Pro Pro Phe 



Gly Glu He Leu 
480 

He Gin Tyr Phe 
495 

Asp Ser Tyr Ser 
510 

Ala Thr Val Val 
525 

Met Val Phe Ser 
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Leu Ala Leu Gly 
545 

Gin Met Gly lie 



Leu Cys Arg Phe 
580 

Thr Ala Val Val 
595 



535 

Trp Thr Asn Met 
550 

Tyr Ala Val Met 
565 

Met Phe Val Tyr 



Thr Leu lie Glu 
600 



- 8- 

540 

Leu Tyr Tyr Thr 
555 

lie Glu Lys Met 
570 

lie Val Phe Leu 
585 

Asp Gly Lys Asn 



Arg Gly Phe Gin 
560 

lie Leu Arg Asp 
575 

Phe Gly Phe Ser 
590 

Asp Ser Leu Pro 
605 



Ser Glu Ser Thr Ser His Arg Trp Arg Gly Pro Ala Cys Arg Pro Pro 
610 615 620 

Asp Ser Ser Tyr Asn Ser Leu Tyr Ser Thr Cys Leu Glu Leu Phe Lys 
625 630 635 640 

Phe Thr lie Gly Met Gly Asp Leu Glu Phe Thr Glu Asn Tyr Asp Phe 
645 650 655 

Lys Ala Val Phe lie lie Leu Leu Leu Ala Tyr Val lie Leu Thr Tyr 
660 665 670 

lie Leu Leu Leu Asn Met Leu lie Ala Leu Met Gly Glu Thr Val Asn 
675 680 685 

Lys lie Ala Gin Glu Ser Lys Asn lie Trp Lys Leu Gin Arg Ala lie 
690 695 700 

Thr lie Leu Asp Thr Glu Lys Ser Phe Leu Lys Cys Met Arg Lys Ala 
705 710 715 720 

Phe Arg Ser Gly Lys Leu Leu Gin Val Gly Tyr Thr Pro Asp Gly Lys 
725 730 735 

Asp Asp Tyr Arg Trp Cys Phe Arg Val Asp Glu Val Asn Trp Thr Thr 
740 745 750 

Trp Asn Thr Asn Val Gly lie lie Asn Glu Asp Pro Gly Asn Cys Glu 
755 760 765 

Gly Val Lys Arg Thr Leu Ser Phe Ser Leu Arg Ser Ser Arg Val Ser 
770 775 780 

Gly Arg His Trp Lys Asn Phe Ala Leu Val Pro Leu Leu Arg Glu Ala 
785 790 795 800 

Ser Ala Arg Asp Arg Gin Ser Ala Gin Pro Glu Glu Val Tyr Leu Arg 
805 810 815 

Gin Phe Ser Gly Ser Leu Lys Pro Glu Asp Ala Glu Val Phe Lys Ser 
820 825 , 830 

Pro Ala Ala Ser Gly Glu Lys 
835 
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<210> 3 
<211> 2517 
<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1) . . (2517) 

<400> 3 

atg aag aaa tgg age age aca gac ttg ggg aca get gcg gac cca etc 48 
Met Lys Lys Trp Ser Ser Thr Asp Leu Gly Thr Ala Ala Asp Pro Leu 
15 10 15 



caa aag gac ace tgc cca gac ccc ctg gat gga gac cct aac tec agg 96 
Gin Lys Asp Thr Cys Pro Asp Pro Leu Asp Gly Asp Pro Asn Ser Arg 
20 25 30 

cca cct cca gec aag ccc cag etc ccc acg gec aag age cgc acc egg 144 
Pro Pro Pro Ala Lys Pro Gin Leu Pro Thr Ala Lys Ser Arg Thr Arg 
35 40 45 

etc ttt ggg aag ggt gac teg gag gag get ttc ccg gtg gat tgc ccc 192 
Leu Phe Gly Lys Gly Asp Ser Glu Glu Ala Phe Pro Val Asp Cys Pro 
50 55 60 

cac gag gaa ggt gag ttg gac tec tgc ccg acc ate aca gtc age cct 240 
His Glu Glu Gly Glu Leu Asp Ser Cys Pro Thr lie Thr Val Ser Pro 
65 70 75 80 

gtt ate acc ate cag agg cca gga gac ggc ccc acc ggt gee agg ctg 288 
Val lie Thr lie Gin Arg Pro Gly Asp Gly Pro Thr Gly Ala Arg Leu 
85 90 95 

ctg tec cag gac tct gtc gee gec age acc gag aag acc etc agg etc 336 
Leu Ser Gin Asp Ser Val Ala Ala Ser Thr Glu Lys Thr Leu Arg Leu 
100 105 110 

tat gat cgc agg agt ate ttt gaa gec gtt get cag aat aac tgc cag 384 
Tyr Asp Arg Arg Ser lie Phe Glu Ala Val Ala Gin Asn Asn Cys Gin 
115 120 125 

gat ctg gag age ctg ctg etc ttc ctg cag aag age aag aag cac etc 432 
Asp Leu Glu Ser Leu Leu Leu Phe Leu Gin Lys Ser Lys Lys His Leu 
130 135 140 



aca gac aac gag ttc aaa gac cct gag 
Thr Asp Asn Glu Phe Lys Asp Pro Glu 
145 150 

aaa gee atg etc aac ctg cac gac gga 
Lys Ala Met Leu Asn Leu His Asp Gly 
165 

etc ctg gag ate gcg egg caa acg gac 



aca ggg aag acc tgt ctg ctg 480 
Thr Gly Lys Thr Cys Leu Leu 
155 160 

cag aac acc acc ate ccc ctg 528 
Gin Asn Thr Thr lie Pro Leu 
170 175 

age ctg aag gag ctt gtc aac 576 
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Leu Leu Glu He Ala Arg Gin Thr Asp Ser Leu Lys Glu Leu Val Asn 
180 185 190 

gcc age tac acg gac age tac tac aag ggc cag aca gca ctg cac ate 624 
Ala Ser Tyr Thr Asp Ser Tyr Tyr Lys Gly Gin Thr Ala Leu His He 
195 200 205 

gcc ate gag aga cgc aac atg gcc ctg gtg acc etc ctg gtg gag aac 672 
Ala He Glu Arg Arg Asn Met Ala Leu Val Thr Leu Leu Val Glu Asn 
210 215 220 

gga gca gac gtc cag get gcg gcc cat ggg gac ttc ttt aag aaa acc 720 
Gly Ala Asp Val Gin Ala Ala Ala His Gly Asp Phe Phe Lys Lys Thr 
225 230 235 240 

aaa ggg egg cct gga ttc tac ttc ggt gaa ctg ccc ctg tec ctg gcc 768 
Lys Gly Arg Pro Gly Phe Tyr Phe Gly Glu Leu Pro Leu Ser Leu Ala 
245 250 255 

gcg tgc acc aac cag ctg ggc ate gtg aag ttc ctg ctg cag aac tec 816 
Ala Cys Thr Asn Gin Leu Gly He Val Lys Phe Leu Leu Gin Asn Ser 
260 265 270 

tgg cag acg gcc gac ate age gcc agg gac teg gtg ggc aac acg gtg 864 
Trp Gin Thr Ala Asp He Ser Ala Arg Asp Ser Val Gly Asn Thr Val 
275 280 285 

ctg cac gcc ctg gtg gag gtg gcc gac aac acg gcc gac aac acg aag 912 
Leu His Ala Leu Val Glu Val Ala Asp Asn Thr Ala Asp Asn Thr Lys 
290 295 300 

ttt gtg acg age atg tac aat gag att ctg atg ctg ggg gcc aaa ctg 960 
Phe Val Thr Ser Met Tyr Asn Glu He Leu Met Leu Gly Ala Lys Leu 
305 310 315 320 

cac ccg acg ctg aag ctg gag gag etc acc aac aag aag gga atg acg 1008 
His Pro Thr Leu Lys Leu Glu Glu Leu Thr Asn Lys Lys Gly Met Thr 
325 330 335 

ccg ctg get ctg gca get ggg acc ggg aag ate ggg gtc ttg gcc tat 1056 
Pro Leu Ala Leu Ala Ala Gly Thr Gly Lys He Gly Val Leu Ala Tyr 
340 345 350 

att etc cag egg gag ate cag gag ccc gag tgc agg cac ctg tec agg 1104 
He Leu Gin Arg Glu lie Gin Glu Pro Glu Cys Arg His Leu Ser Arg 
355 360 365 

aag ttc acc gag tgg gcc tac ggg ccc gtg cac tec teg ctg tac gac 1152 
Lys Phe Thr Glu Trp Ala Tyr Gly Pro Val His Ser Ser Leu Tyr Asp 
370 375 380 

ctg tec tgc ate gac acc tgc gag aag aac teg gtg ctg gag gtg ate 1200 
Leu Ser Cys lie Asp Thr Cys Glu Lys Asn Ser Val Leu Glu Val He 
385 390 395 400 

gcc tac age age age gag acc cct aat cgc cac gac atg etc ttg gtg 1248 
Ala Tyr Ser Ser Ser Glu Thr Pro Asn Arg His Asp Met Leu Leu Val 
405 410 415 



BNSDOCID: <WO 0Q29577A1JA> 



WO 00/29577 



-11- 



PCT/US99/26701 



gag ccg ctg aac cga etc ctg cag gac aag tgg gac aga ttc gtc aag 1296 
Glu Pro Leu Asn Arg Leu Leu Gin Asp Lys Trp Asp Arg Phe Val Lys 
420 425 430 

cgc ate ttc tac ttc aac ttc ctg gtc tac tgc ctg tac atg ate ate 1344 
Arg lie Phe Tyr Phe Asn Phe Leu Val Tyr Cys Leu Tyr Met lie lie 
435 440 445 

ttc acc atg get gec tac tac agg ccc gtg gat ggc ttg cct ccc ttt 1392 
Phe Thr Met Ala Ala Tyr Tyr Arg Pro Val Asp Gly Leu Pro Pro Phe 
450 455 460 



aag atg gaa aaa att gga gac tat ttc cga gtt act gga gag ate ctg 
Lys Met Glu Lys lie Gly Asp Tyr Phe Arg Val Thr Gly Glu lie Leu 
465 470 475 480 



1440 



tct gtg tta gga gga gtc tac ttc ttt ttc cga ggg att cag tat ttc 1488 
Ser Val Leu Gly Gly Val Tyr Phe Phe Phe Arg Gly lie Gin Tyr Phe 
485 490 495 



ctg cag agg egg ccg teg atg aag acc ctg ttt gtg gac age tac agt 1536 

Leu Gin Arg Arg Pro Ser Met Lys Thr Leu Phe Val Asp Ser Tyr Ser 

500 505 510 

gag atg ctt ttc ttt ctg cag tea ctg ttc atg ctg gee acc gtg gtg 1584 

Glu Met Leu Phe Phe Leu Gin Ser Leu Phe Met Leu Ala Thr Val Val 

515 520 525 



ctg tac ttc age cac etc aag gag tat gtg get tec atg gta ttc tec 
Leu Tyr Phe Ser .His Leu Lys Glu Tyr Val Ala Ser Met Val Phe Ser 
530 535 540 



cag atg ggc ate tat gee gtc atg ata gag aag atg ate ctg aga gac 
Gin Met Gly lie Tyr Ala Val Met lie Glu Lys Met He Leu Arg Asp 
565 570 575 



1632 



ctg gec ttg ggc tgg acc aac atg etc tac tac acc cgc ggt ttc cag 1680 
Leu Ala Leu Gly Trp Thr Asn Met Leu Tyr Tyr Thr Arg Gly Phe Gin 
545 550 555 560 



1728 



ctg tgc cgt ttc atg ttt gtc tac ate gtc ttc ttg ttc ggg ttt tec 1776 
Leu Cys Arg Phe Met Phe Val Tyr He Val Phe Leu Phe Gly Phe Ser 
580 585 590 



aca gcg gtg gtg acg ctg 
Thr Ala Val Val Thr Leu 
595 

tct gag tec acg teg cac 
Ser Glu Ser Thr Ser His 
610 

gat age tec tac aac age 
Asp Ser Ser Tyr Asn Ser 
625 630 

ttc acc ate ggc atg ggc 



att gaa gac ggg aag aat 
He Glu Asp Gly Lys Asn 
600 

agg tgg egg ggg cct gec 
Arg Trp Arg Gly Pro Ala 
615 620 

ctg tac tec acc tgc ctg 
Leu Tyr Ser Thr Cys Leu 
635 

gac ctg gag ttc act gag 



gac tec ctg ccg 1824 

Asp Ser Leu Pro 

605 

tgc agg ccc ccc 1872 

Cys Arg Pro Pro 



gag ctg ttc aag 1920 
Glu Leu Phe Lys 
640 

aac tat gac ttc 1968 
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Phe Thr lie Gly Met Gly Asp Leu Glu Phe Thr Glu Asn Tyr Asp Phe 
645 650 655 

aag get gtc ttc ate ate ctg ctg ctg gec tat gta att etc acc tac 2016 
Lys Ala Val Phe lie lie Leu Leu Leu Ala Tyr Val lie Leu Thr Tyr 
660 665 670 

ate etc ctg etc aac atg etc ate gee etc atg ggt gag act gtc aac 2064 
lie Leu Leu Leu Asn Met Leu lie Ala Leu Met Gly Glu Thr Val Asn 
675 680 685 

aag ate gca cag gag age aag aac ate tgg aag ctg cag aga gec ate 2112 
Lys lie Ala Gin Glu Ser Lys Asn lie Trp Lys Leu Gin Arg Ala lie 
690 695 700 

acc ate ctg gac acg gag aag age ttc ctt aag tgc atg agg aag gee 2160 
Thr lie Leu Asp Thr Glu Lys Ser Phe Leu Lys Cys Met Arg Lys Ala 
705 710 715 720 

ttc cgc tea ggc aag ctg ctg cag gtg ggg tac aca cct gat ggc aag 2208 
Phe Arg Ser Gly Lys Leu Leu Gin Val Gly Tyr Thr Pro Asp Gly Lys 
725 730 735 

gac gac tac egg tgg tgc ttc agg gtg gac gag gtg aac tgg acc acc 2256 
Asp Asp Tyr Arg Trp Cys Phe Arg Val Asp Glu Val Asn Trp Thr Thr 
740 745 750 

tgg aac acc aac gtg ggc ate ate aac gaa gac ccg ggc aac tgt gag 2304 
Trp Asn Thr Asn Val Gly lie He Asn Glu Asp Pro Gly Asn Cys Glu 
755 760 765 

ggc gtc aag cgc acc ctg age ttc tec ctg egg tea age aga gtt tea 2352 
Gly Val Lys Arg Thr Leu Ser Phe Ser Leu Arg Ser Ser Arg Val Ser 
770 775 780 

ggc aga cac tgg aag aac ttt gee ctg gtc ccc ctt tta aga gag gca 2400 
Gly Arg His Trp Lys Asn Phe Ala Leu Val Pro Leu Leu Arg Glu Ala 
785 790 795 800 

agt get cga gat agg cag tct get cag ccc gag gaa gtt tat ctg cga 2448 
Ser Ala Arg Asp Arg Gin Ser Ala Gin Pro Glu Glu Val Tyr Leu Arg 
805 810 815 



cag ttt tea ggg tct ctg aag cca gag gac get gag gtc ttc aag agt 2496 
Gin Phe Ser Gly Ser Leu Lys Pro Glu Asp Ala Glu Val Phe Lys Ser 
820 825 830 

cct gee get tec ggg gag aag 2517 
Pro Ala Ala Ser Gly Glu Lys 
835 

<210> 4 

<211> 2809 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 



BNSDOCID: <WO 0029577A1 JA> 



WO 00/29577 PCT/US99/26701 

- 13- 

<222> (361) . . (2652) 
<400> 4 

ggctagcctg tcctgacagg ggagagttaa gctcccgttc tccaccgtgc cggctggcca 60 

ggtgggctga gggtgaccga gagaccagaa cctgcttgct ggagcttagt gctcagagct 120 

ggggagggag gttccgccgc tcctctgctg tcagcgccgg cagcccctcc cggcttcact 180 

tcctcccgca gcccctgcta ctgagaagct ccgggatccc agcagccgcc acgccctggc 24 0 

ctcagcctgc ggggctccag tcaggccaac accgacgcgc agctgggagg aagacaggac 300 

ccttgacatc tccatctgca cagaggtcct ggctggaccg agcagcctcc tcctcctagg 360 



atg acc tea ccc tec age tct cca gtt ttc agg ttg gag aca tta gat 
Met Thr Ser Pro Ser Ser Ser Pro Val Phe Arg Leu Glu Thr Leu Asp 
15 10 15 



408 



gga ggc caa gaa gat ggc tct gag gcg gac aga gga aag ctg gat ttt 456 
Gly Gly Gin Glu Asp Gly Ser Glu Ala Asp Arg Gly Lys Leu Asp Phe 
20 25 30 



ggg age ggg ctg cct ccc atg gag tea cag ttc cag ggc gag gac egg 
Gly Ser Gly Leu Pro Pro Met Glu Ser Gin Phe Gin Gly Glu Asp Arg 
35 40 45 



504 



aaa ttc gee cct cag ata aga gtc aac etc aac tac cga aag gga aca 552 
Lys Phe Ala Pro Gin lie Arg Val Asn Leu Asn Tyr Arg Lys Gly Thr 
50 55 60 



ggt gee agt cag ccg gat cca aac cga ttt gac cga gat egg etc ttc 
Gly Ala Ser Gin Pro Asp Pro Asn Arg Phe Asp Arg Asp Arg Leu Phe 
65 70 75 80 



600 



aat gcg gtc tec egg ggt gtc ccc gag gat ctg get gga ctt cca gag 648 
Asn Ala Val Ser Arg Gly Val Pro Glu Asp Leu Ala Gly Leu Pro Glu 
85 90 95 

tac ctg age aag acc age aag tac etc acc gac teg gaa tac aca gag 696 
Tyr Leu Ser Lys Thr Ser Lys Tyr Leu Thr Asp Ser Glu Tyr Thr Glu 
100 105 110 

ggc tec aca ggt aag acg tgc ctg atg aag get gtg ctg aac ctt aag 744 
Gly Ser Thr Gly Lys Thr Cys Leu Met Lys Ala Val Leu Asn Leu Lys 
115 120 125 

gac gga gtc aat gec tgc att ctg cca ctg ctg cag ate gac agg gac 792 
Asp Gly Val Asn Ala Cys lie Leu Pro Leu Leu Gin lie Asp Arg Asp 
130 135 140 

tct ggc aat cct cag ccc ctg gta aat gee cag tgc aca gat gac tat 840 
Ser Gly Asn Pro Gin Pro Leu Val Asn Ala Gin Cys Thr Asp Asp Tyr 
145 150 155 160 

tac cga ggc cac age get ctg cac ate gee att gag aag agg agt ctg 888 
Tyr Arg Gly His Ser Ala Leu His lie Ala lie Glu Lys Arg Ser Leu 
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165 170 175 

cag tgt gtg aag etc ctg gtg gag aat ggg gec aat gtg cat gec egg 936 
Gin Cys Val Lys Leu Leu Val Glu Asn Gly Ala Asn Val His Ala Arg 
180 185 190 

gec tgc ggc cgc ttc ttc cag aag ggc caa ggg act tgc ttt tat ttc 984 
Ala Cys Gly Arg Phe Phe Gin Lys Gly Gin Gly Thr Cys Phe Tyr Phe 
195 200 205 

ggt gag eta ccc etc tct ttg gec get tgc acc aag cag tgg gat gtg 1032 
Gly Glu Leu Pro Leu Ser Leu Ala Ala Cys Thr Lys Gin Trp Asp Val 
210 215 220 

gta age tac etc ctg gag aac cca cac cag ccc gee age ctg cag gee 1080 
Val Ser Tyr Leu Leu Glu Asn Pro His Gin Pro Ala Ser Leu Gin Ala 
225 230 235 240 

act gac tec cag ggc aac aca gtc ctg cat gec eta gtg atg ate teg 1128 
Thr Asp Ser Gin Gly Asn Thr Val Leu His Ala Leu Val Met lie Ser 
245 250 255 

gac aac tea get gag aac att gca ctg gtg acc age atg tat gat ggg 1176 
Asp Asn Ser Ala Glu Asn lie Ala Leu Val Thr Ser Met Tyr Asp Gly 
260 265 270 



etc etc caa get ggg gec cgc etc tgc cct acc gtg cag ctt gag gac 1224 
Leu Leu Gin Ala Gly Ala Arg Leu Cys Pro Thr Val Gin Leu Glu Asp 
275 280 285 

ate cgc aac ctg cag gat etc acg cct ctg aag ctg gee gec aag gag 1272 
lie Arg Asn Leu Gin Asp Leu Thr Pro Leu Lys Leu Ala Ala Lys Glu 
290 295 300 

ggc aag ate gag att ttc agg cac ate ctg cag egg gag ttt tea gga 1320 
Gly Lys lie Glu lie Phe Arg His lie Leu Gin Arg Glu Phe Ser Gly 
305 310 315 320 

ctg age cac ctt tec cga aag ttc acc gag tgg tgc tat ggg cct gtc 1368 
Leu Ser His Leu Ser Arg Lys Phe Thr Glu Trp Cys Tyr Gly Pro Val 
325 330 335 

egg gtg teg ctg tat gac ctg get tct gtg gac age tgt gag gag aac 1416 
Arg Val Ser Leu Tyr Asp Leu Ala Ser Val Asp Ser Cys Glu Glu Asn 
340 345 350 

tea gtg ctg gag ate att gec ttt cat tgc aag age ccg cac cga cac 14 64 
Ser Val Leu Glu lie lie Ala Phe His Cys Lys Ser Pro His Arg His 
355 360 365 

cga atg gtc gtt ttg gag ccc ctg aac aaa ctg ctg cag gcg aaa tgg 1512 
Arg Met Val Val Leu Glu Pro Leu Asn Lys Leu Leu Gin Ala Lys Trp 
370 375 380 

gat ctg etc ate ccc aag ttc ttc tta aac ttc ctg tgt aat ctg ate 1560 
Asp Leu Leu lie Pro Lys Phe Phe Leu Asn Phe Leu Cys Asn Leu lie 
385 390 395 400 
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tac atg ttc ate ttc acc get gtt gee tac cat cag cct ace ctg aag 1608 
Tyr Met Phe lie Phe Thr Ala Val Ala Tyr His Gin Pro Thr Leu Lys 
405 410 415 

aag cag gee gee cct cac ctg aaa gcg gag gtt gga aac tec atg ctg 1656 
Lys Gin Ala Ala Pro His Leu Lys Ala Glu Val Gly Asn Ser Met Leu 
420 425 430 

ctg acg ggc cac ate ctt ate ctg eta ggg ggg ate tac etc etc gtg 1704 
Leu Thr Gly His lie Leu lie Leu Leu Gly Gly lie Tyr Leu Leu Val 
435 440 445 

ggc cag ctg tgg tac ttc tgg egg cgc cac gtg ttc ate tgg ate teg 1752 
Gly Gin Leu Trp Tyr Phe Trp Arg Arg His Val Phe lie Trp lie Ser 
450 455 460 

ttc ata gac age tac ttt gaa ate etc ttc ctg ttc cag gee ctg etc 1800 
Phe He Asp Ser Tyr Phe Glu He Leu Phe Leu Phe Gin Ala Leu Leu 
465 470 475 480 

aca gtg gtg tec cag gtg ctg tgt ttc ctg gec ate gag tgg tac ctg 1848 
Thr Val Val Ser Gin Val Leu Cys Phe Leu Ala He Glu Trp Tyr Leu 
485 490 495 

ccc ctg ctt gtg tct gcg ctg gtg ctg ggc tgg ctg aac ctg ctt tac 1896 
Pro Leu Leu Val Ser Ala Leu Val Leu Gly Trp Leu Asn Leu Leu Tyr 
500 505 510 



tat aca cgt ggc ttc cag cac aca ggc ate tac agt gtc atg ate cag 1944 

Tyr Thr Arg Gly Phe Gin His Thr Gly He Tyr Ser Val Met He Gin 

515 520 525 

aag gtc ate ctg egg gac ctg ctg cgc ttc ctt ctg ate tac tta gtc 1992 

Lys Val He Leu Arg Asp Leu Leu Arg Phe Leu Leu He Tyr Leu Val 

530 535 540 

ttc ctt ttc ggc ttc get gta gee ctg gtg age ctg age cag gag get 2040 

Phe Leu Phe Gly Phe Ala Val Ala Leu Val Ser Leu Ser Gin Glu Ala 

545 550 555 560 



tgg cgc ccc gaa get cct aca ggc ccc aat gee aca gag tea gtg cag 2088 

Trp Arg Pro Glu Ala Pro Thr Gly Pro Asn Ala Thr Glu Ser Val Gin 
565 570 575 

ccc atg gag gga cag gag gac gag ggc aac ggg gee cag tac agg ggt 2136 

Pro Met Glu Gly Gin Glu Asp Glu Gly Asn Gly Ala Gin Tyr Arg Gly 
580 585 590 

ate ctg gaa gee tec ttg gag etc ttc aaa ttc acc ate ggc atg ggc 2184 

He Leu Glu Ala Ser Leu Glu Leu Phe Lys Phe Thr He Gly Met Gly 

595 600 605 

gag ctg gee ttc cag gag cag ctg cac ttc cgc ggc atg gtg ctg ctg 2232 

Glu Leu Ala Phe Gin Glu Gin Leu His Phe Arg Gly Met Val Leu Leu 

610 615 620 

ctg ctg ctg gee tac gtg ctg etc acc tac ate ctg ctg etc aac atg 2280 
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Leu Leu Leu Ala Tyr Val Leu Leu Thr Tyr lie Leu Leu Leu Asn Met 
625 630 635 640 

etc ate gee etc atg age gag ace gtc aac agt gtc gee act gac age 2328 
Leu lie Ala Leu Met Ser Glu Thr Val Asn Ser Val Ala Thr Asp Ser 
645 650 655 

tgg age ate tgg aag ctg cag aaa gec ate tct gtc ctg gag atg gag 2376 
Trp Ser lie Trp Lys Leu Gin Lys Ala lie Ser Val Leu Glu Met Glu 
660 665 670 

aat ggc tat tgg tgg tgc agg aag aag cag egg gca ggt gtg atg ctg 2424 
Asn Gly Tyr Trp Trp Cys Arg Lys Lys Gin Arg Ala Gly Val Met Leu 
675 680 685 

acc gtt ggc act aag cca gat ggc age ccg gat gag cgc tgg tgc ttc 2472 
Thr Val Gly Thr Lys Pro Asp Gly Ser Pro Asp Glu Arg Trp Cys Phe 
690 695 700 

agg gtg gag gag gtg aac tgg get tea tgg gag cag acg ctg cct acg 2520 
Arg Val Glu Glu Val Asn Trp Ala Ser Trp Glu Gin Thr Leu Pro Thr 
705 710 715 720 

ctg tgt gag gac ccg tea ggg gca ggt gtc cct cga act etc gag aac "2568 
Leu Cys Glu Asp Pro Ser Gly Ala Gly Val Pro Arg Thr Leu Glu Asn 
725 730 735 

cct gtc ctg get tec cct ccc aag gag gat gag gat ggt gee tct gag 2616 
Pro Val Leu Ala Ser Pro Pro Lys Glu Asp Glu Asp Gly Ala Ser Glu 
740 745 750 

gaa aac tat gtg ccc gtc cag etc etc cag tec aac tgatggccca 2662 
Glu Asn Tyr Val Pro Val Gin Leu Leu Gin Ser Asn 
755 760 

gatgeagcag gaggecagag gacagagcag aggatctttc caaccacatc tgctggctct 2722 

ggggtcccag tgaattctgg tggcaaatat atattttcac taactcaaaa aaaaaaaaaa 2782 

aaaaaaaaaa aaaaaaaaaa aaaaaaa 2809 



<210> 5 
<211> 764 
<212> PRT 

<213> Homo sapiens 
<400> 5 

Met Thr Ser Pro Ser Ser Ser Pro Val Phe Arg Leu Glu Thr Leu Asp 
15 10 15 

Gly Gly Gin Glu Asp Gly Ser Glu Ala Asp Arg Gly Lys Leu Asp Phe 
20 25 30 

Gly Ser Gly Leu Pro Pro Met Glu Ser Gin Phe Gin Gly Glu Asp Arg 
35 40 45 

Lys Phe Ala Pro Gin lie Arg Val Asn Leu Asn Tyr Arg Lys Gly Thr 
50 55 60 
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Gly Ala Ser Gin 
65 

Asn Ala Val Ser 



Tyr Leu Ser Lys 
100 

Gly Ser Thr Gly 
115 

Asp Gly Val Asn 
130 

Ser Gly Asn Pro 
145 

T y r A r n Gly His 



Gin Cys Val Lys 
180 

Ala Cys Gly Arg 
195 

Gly Glu Leu Pro 
210 

Val Ser Tyr Leu 
225 

Thr Asp Ser Gin 



Pro Asp Pro Asn 
70 

Arg Gly Val Pro 
85 

Thr Ser Lys Tyr 



Lys Thr Cys Leu 
120 

Ala Cys lie Leu 
135 

Gin Pro Leu Val 
150 

Ser Ala Leu His 
165 

Leu Leu Val Glu 



Phe Phe Gin Lys 
200 

Leu Ser Leu Ala 
215 

Leu Glu Asn Pro 
230 

Gly Asn Thr Val 
245 



Arg Phe Asp Arg 
75 

Glu Asp Leu Ala 
90 

Leu Thr Asp Ser 
105 

Met Lys Ala Val 



Pro Leu Leu Gin 
140 

Asn Ala Gin Cys 
155 

He Ala He Glu 
170 

Asn Gly Ala Asn 
185 

Gly Gin Gly Thr 



Ala Cys Thr Lys 
220 

His Gin Pro Ala 
235 

Leu His Ala Leu 
250 



Asp Arg Leu Phe 
80 

Gly Leu Pro Glu 
95 

Glu Tyr Thr Glu 
110 

Leu Asn Leu Lys 
125 

He Asp Arg Asp 



Thr Asp Asp Tyr 
160 

Lys Arg Ser Leu 
175 

Val His Ala Arg 
190 

Cys Phe Tyr Phe 
205 

Gin Trp Asp Val 



Ser Leu Gin Ala 
240 

Val Met He Ser 
255 



Asp Asn Ser Ala Glu 
260 

Leu Leu Gin Ala Gly 
275 

lie Arg Asn Leu Gin 
290 

Gly Lys He Glu He 
305 

Leu Ser His Leu Ser 
325 

Arg Val Ser Leu Tyr 
340 

Ser Val Leu Glu He 
355 



Asn He Ala Leu Val 
265 

Ala Arg Leu Cys Pro 
280 

Asp Leu Thr Pro Leu 
295 

Phe Arg His He Leu 
310 

Arg Lys Phe Thr Glu 
330 

Asp Leu Ala Ser Val 
345 

He Ala Phe His Cys 
360 



Thr Ser Met Tyr Asp Gly 
270 

Thr Val Gin Leu Glu Asp 
285 

Lys Leu Ala Ala Lys Glu 
300 

Gin Arg Glu Phe Ser Gly 
315 320 

Trp Cys Tyr Gly Pro Val 
335 

Asp Ser Cys Glu Glu Asn 
350 

Lys Ser Pro His Arg His 
365 
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Arg Met Val Val 
370 

Asp Leu Leu lie 
385 

Tyr Met Phe lie 



Lys Gin Ala Ala 
420 

Leu Thr Gly His 
435 

Gly Gin Leu Trp 
450 

Phe lie Asp Ser 
465 

Thr Val Val Ser 



Pro Leu Leu Val 
500 

Tyr Thr Arg Gly 
515 

Lys Val lie Leu 
530 

Phe Leu Phe Gly 
545 

Trp Arg Pro Glu 



Pro Met Glu Gly 
580 

lie Leu Glu Ala 
595 

Glu Leu Ala Phe 
610 

Leu Leu Leu Ala 
625 

Leu lie Ala Leu 



Trp Ser lie Trp 
660 

Asn Gly Tyr Trp 
675 



Leu Glu Pro Leu 
375 

Pro Lys Phe Phe 
390 

Phe Thr Ala Val 
405 

Pro His Leu Lys 



lie Leu lie Leu 
440 

Tyr Phe Trp Arg 
455 

Tyr Phe Glu lie 
470 

Gin Val Leu Cys 
485 

Ser Ala Leu Val 



Phe Gin His Thr 
520 

Arg Asp Leu Leu 
535 

Phe Ala Val Ala 
550 

Ala Pro Thr Gly 
565 

Gin Glu Asp Glu 



Ser Leu Glu Leu 
600 

Gin Glu Gin Leu 
615 

Tyr Val Leu Leu 
630 

Met Ser Glu Thr 
645 

Lys Leu Gin Lys 



Trp Cys Arg Lys 
680 



Asn Lys Leu Leu 
380 

Leu Asn Phe Leu 
395 

Ala Tyr His Gin 
410 

Ala Glu Val Gly 
425 

Leu Gly Gly lie 



Arg His Val Phe 
460 

Leu Phe Leu Phe 
475 

Phe Leu Ala lie 
490 

Leu Gly Trp Leu 
505 

Gly lie Tyr Ser 



Arg Phe Leu Leu 
540 

Leu Val Ser Leu 
555 

Pro Asn Ala Thr 
570 

Gly Asn Gly Ala 
585 

Phe Lys Phe Thr 



His Phe Arg Gly 
620 

Thr Tyr lie Leu 
635 

Val Asn Ser Val 
650 

Ala lie Ser Val 
665 

Lys Gin Arg Ala 



Gin Ala Lys Trp 



Cys Asn Leu lie 
400 

Pro Thr Leu Lys 
415 

Asn Ser Met Leu 
430 

Tyr Leu Leu Val 
445 

lie Trp lie Ser 



Gin Ala Leu Leu 
480 

Glu Trp Tyr Leu 
4 95 

Asn Leu Leu Tyr 
510 

Val Met lie Gin 

525 

lie Tyr Leu Val 



Ser Gin Glu Ala 
560 

Glu Ser Val Gin 
575 

Gin Tyr Arg Gly 
590 

lie Gly Met Gly 
605 

Met Val Leu Leu 



Leu Leu Asn Met 
640 

Ala Thr Asp Ser 
655 

Leu Glu Met Glu 
670 

Gly Val Met Leu 
685 
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Thr Val Gly Thr 
690 

Arg Val Glu Glu 
705 

Leu Cys Glu Asp 



Pro Val Leu Ala 
740 

Glu Asn Tyr Val 
755 



Lys Pro Asp Gly 
695 

Val Asn Trp Ala 
710 

Pro Ser Gly Ala 
725 

Ser Pro Pro Lys 



Pro Val Gin Leu 
760 



Ser Pro Asp Glu 
700 

Ser Trp Glu Gin 
715 

Gly Val Pro Arg 
730 

Glu Asp Glu Asp 
745 

Leu Gin Ser Asn 



Arg Trp Cys Phe 



Thr Leu Pro Thr 
720 

Thr Leu Glu Asn 
735 

Gly Ala Ser Glu 
750 



<210> 6 

<211> 2292 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1) . . (2292) 

<400> 6 

atg acc tea ccc tec age tct cca gtt ttc agg ttg gag aca tta gat 48 

Met Thr Ser Pro Ser Ser Ser Pro Val Phe Arg Leu Glu Thr Leu Asp 

15 10 15 

gga ggc caa gaa gat ggc tct gag gcg gac aga gga aag ctg gat ttt 96 
Gly Gly Gin Glu Asp Gly Ser Glu Ala Asp Arg Gly Lys Leu Asp Phe 
20 25 30 



ggg age ggg ctg cct ccc atg gag tea cag ttc cag ggc gag gac egg 144 
Gly Ser Gly Leu Pro Pro Met Glu Ser Gin Phe Gin Gly Glu Asp Arg 
35 40 45 

aaa ttc gee cct cag ata aga gtc aac etc aac tac cga aag gga aca 192 
Lys Phe Ala Pro Gin He Arg Val Asn Leu Asn Tyr Arg Lys Gly Thr 
50 55 60 

ggt gee agt cag ccg gat cca aac cga ttt gac cga gat egg etc ttc 240 
Gly Ala Ser Gin Pro Asp Pro Asn Arg Phe Asp Arg Asp Arg Leu Phe 
65 70 75 80 

aat gcg gtc tec egg ggt gtc ccc gag gat ctg get gga ctt cca gag 288. 
Asn Ala Val Ser Arg Gly Val Pro Glu Asp Leu Ala Gly Leu Pro Glu 
85 90 95 

tac ctg age aag acc age aag tac etc acc gac teg gaa tac aca gag 336 
Tyr Leu Ser Lys Thr Ser Lys Tyr Leu Thr Asp Ser Glu Tyr Thr Glu 
100 105 110 

ggc tec aca ggt aag acg tgc ctg atg aag get gtg ctg aac ctt aag 384 
Gly Ser Thr Gly Lys Thr Cys Leu Met Lys Ala Val Leu Asn Leu Lys 
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115 120 125 

gac gga gtc aat gcc tgc att ctg cca ctg ctg cag ate gac agg gac 432 
Asp Gly Val Asn Ala Cys lie Leu Pro Leu Leu Gin lie Asp Arg Asp 
130 135 140 

tct ggc aat cct cag ccc ctg gta aat gcc cag tgc aca gat gac tat 480 
Ser Gly Asn Pro Gin Pro Leu Val Asn Ala Gin Cys Thr Asp Asp Tyr 
145 150 155 160 

tac cga ggc cac age get ctg cac ate gcc att gag aag agg agt ctg 528 
Tyr Arg Gly His Ser Ala Leu His lie Ala lie Glu Lys Arg Ser Leu 
165 170 175 

cag tgt gtg aag etc ctg gtg gag aat ggg gcc aat gtg cat gcc egg 576 
Gin Cys Val Lys Leu Leu Val Glu Asn Gly Ala Asn Val His Ala Arg 
180 185 190 

gcc tgc ggc cgc ttc ttc cag aag ggc caa ggg act tgc ttt tat ttc 624 
Ala Cys Gly Arg Phe Phe Gin Lys Gly Gin Gly Thr Cys Phe Tyr Phe 
195 200 205 

ggt gag eta ccc etc tct ttg gcc get tgc ace aag cag tgg gat gtg 672 
Gly Glu Leu Pro Leu Ser Leu Ala Ala Cys Thr Lys Gin Trp Asp Val 
210 215 220 

gta age tac etc ctg gag aac cca cac cag ccc gcc age ctg cag gcc 720 
Val Ser Tyr Leu Leu Glu Asn Pro His Gin Pro Ala Ser Leu Gin Ala 
225 230 235 240 

act gac tec cag ggc aac aca gtc ctg cat gcc eta gtg atg ate teg 768 
Thr Asp Ser Gin Gly Asn Thr Val Leu His Ala Leu Val Met lie Ser 
245 250 255 

gac aac tea get gag aac att gca ctg gtg ace age atg tat gat ggg 816 
Asp Asn Ser Ala Glu Asn lie Ala Leu Val Thr Ser Met Tyr Asp Gly 
260 265 270 



etc etc caa get ggg gcc cgc etc tgc cct ace gtg cag ctt gag gac 864 

Leu Leu Gin Ala Gly Ala Arg Leu Cys Pro Thr Val Gin Leu Glu Asp 

275 280 285 

ate cgc aac ctg cag gat etc acg cct ctg aag ctg gcc gcc aag gag 912 

lie Arg Asn Leu Gin Asp Leu Thr Pro Leu Lys Leu Ala Ala Lys Glu 
290 295 300 

ggc aag ate gag att ttc agg cac ate ctg cag egg gag ttt tea gga 960 

Gly Lys lie Glu lie Phe Arg His lie Leu Gin Arg Glu Phe Ser Gly 
305 310 315 320 

ctg age cac ctt tec cga aag ttc ace gag tgg tgc tat ggg cct gtc 1008 

Leu Ser His Leu Ser Arg Lys Phe Thr Glu Trp Cys Tyr Gly Pro Val 
325 330 335 

egg gtg teg ctg tat gac ctg get tct gtg gac age tgt gag gag aac 1056 

Arg Val Ser Leu Tyr Asp Leu Ala Ser Val Asp Ser Cys Glu Glu Asn 
340 345 350 
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tea gtg ctg gag ate att gec ttt cat tgc aag age ccg cac cga cac 1104 
Ser Val Leu Glu lie lie Ala Phe His Cys Lys Ser Pro His Arg His 
355 360 365 



cga atg gtc gtt ttg gag ccc ctg aac aaa ctg ctg cag gcg aaa tgg 1152 
Arg Met Val Val Leu Glu Pro Leu Asn Lys Leu Leu Gin Ala Lys Trp 
370 375 380 



gat ctg etc ate ccc aag ttc ttc tta aac ttc ctg tgt aat ctg ate 1200 
Asp Leu Leu lie Pro Lys Phe Phe Leu Asn Phe Leu Cys Asn Leu lie 
385 390 395 400 



tac atg ttc ate ttc acc get gtt gec tac cat cag cct acc ctg aag 1248 
Tyr Met Phe lie Phe Thr Ala Val Ala Tyr His Gin Pro Thr Leu Lys 
405 410 415 



aag cag gec gee cct cac ctg aaa gcg gag gtt gga aac tec atg ctg 1296 
Lys Gin Ala Ala Pro His Leu Lys Ala Glu Val Gly Asn Ser Met Leu 
420 425 430 



ctg acg ggc cac ate ctt 
Leu Thr Gly His lie Leu 
435 

ggc cag ctg tgg tac ttc 
Gly Gin Leu Trp Tyr Phe 
450 

ttc ata gac age tac ttt 
Phe lie Asp Ser Tyr Phe 
465 470 



ate ctg eta ggg ggg ate 

He Leu Leu Gly Gly He 
440 

tgg c 99 c 9 c cac gtg ttc 

Trp Arg Arg His Val Phe 

455 460 

gaa ate etc ttc ctg ttc 

Glu He Leu Phe Leu Phe 
475 



tac etc etc gtg 1344 

Tyr Leu Leu Val 

445 

ate tgg ate teg 1392 
He Trp He Ser 



cag gee ctg etc 1440 
Gin Ala Leu Leu 
480 



aca gtg gtg tec cag gtg ctg tgt ttc ctg gee ate gag tgg tac ctg 1488 
Thr Val Val Ser Gin Val Leu Cys Phe Leu Ala He Glu Trp Tyr Leu 
485 490 495 



ccc ctg ctt gtg tct gcg ctg gtg ctg ggc tgg ctg aac ctg ctt tac 1536 
Pro Leu Leu Val Ser Ala Leu Val Leu Gly Trp Leu Asn Leu Leu Tyr 
500 505 510 



tat aca cgt ggc ttc cag 
Tyr Thr Arg Gly Phe Gin 
515 

aag gtc ate ctg egg gac 
Lys Val lie Leu Arg Asp 
530 

ttc ctt ttc ggc ttc get 
Phe Leu Phe Gly Phe Ala 
545 550 



cac aca ggc ate tac agt 

His Thr Gly He Tyr Ser 
520 

ctg ctg cgc ttc ctt ctg 

Leu Leu Arg Phe Leu Leu 

535 540 

gta gee ctg gtg age ctg 

Val Ala Leu Val Ser Leu 
555 



gtc atg ate cag 1584 

Val Met He Gin 

525 

ate tac tta gtc 1632 
He Tyr Leu Val 



age cag gag get 1680 
Ser Gin Glu Ala 
560 



tgg cgc ccc gaa get cct aca ggc ccc 
Trp Arg Pro Glu Ala Pro Thr Gly Pro 
565 



aat gee aca gag tea gtg cag 1728 
Asn Ala Thr Glu Ser Val Gin 
570 575 



ccc atg gag gga cag gag gac gag ggc aac ggg gee cag tac agg ggt 1776 
Pro Met Glu Gly Gin Glu Asp Glu Gly Asn Gly Ala Gin Tyr Arg Gly 
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580 585 590 

ate ctg gaa gec tec ttg gag etc ttc aaa ttc acc ate ggc atg ggc 1824 
lie Leu Glu Ala Ser Leu Glu Leu Phe Lys Phe Thr lie Gly Met Gly 
595 600 605 

gag ctg gec ttc cag gag cag ctg cac ttc cgc ggc atg gtg ctg ctg 1872 
Glu Leu Ala Phe Gin Glu Gin Leu His Phe Arg Gly Met Val Leu Leu 
610 615 620 

ctg ctg ctg gec tac gtg ctg etc acc tac ate ctg ctg etc aac atg 1920 
Leu Leu Leu Ala Tyr Val Leu Leu Thr Tyr lie Leu Leu Leu Asn Met 
625 630 6*35 640 

etc ate gee etc atg age gag acc gtc aac agt gtc gee act gac age 1968 
Leu lie Ala Leu Met Ser Glu Thr Val Asn Ser Val Ala Thr Asp Ser 
645 650 655 

tgg age ate tgg aag ctg cag aaa gee ate tct gtc ctg gag atg gag 2016 
Trp Ser lie Trp Lys Leu Gin Lys Ala lie Ser Val Leu Glu Met Glu 
660 665 670 

aat ggc tat tgg tgg tgc agg aag aag cag egg gca ggt gtg atg ctg 2064 
Asn Gly Tyr Trp Trp Cys Arg Lys Lys Gin Arg Ala Gly Val Met Leu 
675 680 685 

acc gtt ggc act aag cca gat ggc age ccg gat gag cgc tgg tgc ttc 2112 
Thr Val Gly Thr Lys Pro Asp Gly Ser Pro Asp Glu Arg Trp Cys Phe 
690 695 700 



agg gtg gag gag gtg aac tgg get tea 
Arg Val Glu Glu Val Asn Trp Ala Ser 
705 710 

ctg tgt gag gac ccg tea ggg gca ggt 

Leu Cys Glu Asp Pro Ser Gly Ala Gly 
725 

cct gtc ctg get tec cct ccc aag gag 

Pro Val Leu Ala Ser Pro Pro Lys Glu 

740 745 

gaa aac tat gtg ccc gtc cag etc etc 
Glu Asn Tyr Val Pro Val Gin Leu Leu 
755 760 



tgg gag cag acg ctg cct acg 2160 
Trp Glu Gin Thr Leu Pro Thr 
715 720 

gtc cct cga act etc gag aac 2208 
Val Pro Arg Thr Leu Glu Asn 
730 735 

gat gag gat ggt gee tct gag 2256 
Asp Glu Asp Gly Ala Ser Glu 
750 

cag tec aac 2292 
Gin Ser Asn 



<210> 7 
<211> 1489 
<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (3) . . (1310) 

<400> 7 



BNSDOCID: <WO 0Q29577A1JA> 



WO 00/29577 



PCT/US99/26701 



-23- 

gc ggc cgc ttc ttc cag aag ggc caa ggg act tgc ttt tat ttc ggt 47 
Gly Arg Phe Phe Gin Lys Gly Gin Gly Thr Cys Phe Tyr Phe Gly 
15 10 15 

gag eta ccc etc tct ttg gee get tgc acc aag cag tgg gat gtg gta 95 
Glu Leu Pro Leu Ser Leu Ala Ala Cys Thr Lys Gin Trp Asp Val Val 
20 25 30 

age tac etc ctg gag aac cca cac cag ccc gee age ctg cag gee act 143 
Ser Tyr Leu Leu Glu Asn Pro His Gin Pro Ala Ser Leu Gin Ala Thr 
35 40 45 

gac tec cag ggc aac aca gtc ctg cat gee eta gtg atg ate teg gac 191 
Asp Ser Gin Gly Asn Thr Val Leu His Ala Leu Val Met lie Ser Asp 
50 55 60 

aac tea get gag aac att gca ctg gtg acc age atg tat gat ggg etc 239 
Asn Ser Ala Glu Asn lie Ala Leu Val Thr Ser Met Tyr Asp Gly Leu 
65 70 75 

etc caa get ggg gee cgc etc tgc cct acc gtg cag ctt gag gac ate 287 
Leu Gin Ala Gly Ala Arg Leu Cys Pro Thr Val Gin Leu Glu Asp lie 
80 85 90 95 

cgc aac ctg cag gat etc acg cct ctg aag ctg gec gee aag gag ggc 335 
Arg Asn Leu Gin Asp Leu Thr Pro Leu Lys Leu Ala Ala Lys Glu Gly 
100 105 110 



aag ate gag att ttc agg cac ate ctg cag egg gag ttt tea gga ctg 383 
Lys He Glu He Phe Arg His He Leu Gin Arg Glu Phe Ser Gly Leu 
115 120 125 

age cac ctt tec cga aag ttc acc gag tgg tgc tat ggg cct gtc egg 431 
Ser His Leu Ser Arg Lys Phe Thr Glu Trp Cys Tyr Gly Pro Val Arg 
130 135 140 

gtg teg ctg tat gac ctg get tct gtg gac age tgt gag gag aac tea 479 
Val Ser Leu Tyr Asp Leu Ala Ser Val Asp Ser Cys Glu Glu Asn Ser 
145 150 155 

gtg ctg gag ate att gee ttt cat tgc- aag age ccg cac cga cac cga 527 
Val Leu Glu He He Ala Phe His Cys Lys Ser Pro His Arg His Arg 
160 165 170 175 

atg gtc gtt ttg gag ccc ctg aac aaa ctg ctg cag gcg aaa tgg gat 575 
Met Val Val Leu Glu Pro Leu Asn Lys Leu Leu Gin Ala Lys Trp Asp 
180 185 190 

ctg etc ate ccc aag ttc ttc tta aac ttc ctg tgt aat ctg ate tac 623 
Leu Leu He Pro Lys Phe Phe Leu Asn Phe Leu Cys Asn Leu lie Tyr 
195 200 205 

atg ttc ate ttc acc get gtt gee tac cat cag cct acc ctg aag aag 671 
Met Phe He Phe Thr Ala Val Ala Tyr His Gin Pro Thr Leu Lys Lys 
210 215 220 

cag gee gee cct cac ctg aaa gcg gag gtt gga aac tec atg ctg ctg 719 
Gin Ala Ala Pro His Leu Lys Ala Glu Val Gly Asn Ser Met Leu Leu 
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225 230 

acg ggc cac ate ctt ate ctg eta ggg 
Thr Gly His lie Leu lie Leu Leu Gly 
240 245 

cag ctg tgg tac ttc tgg egg cgc cac 
Gin Leu Trp Tyr Phe Trp Arg Arg His 
260 

ata gac age tac ttt gaa ate etc ttc 
lie Asp Ser Tyr Phe Glu lie Leu Phe 
275 280 

gtg gtg tec cag gtg ctg tgt ttc ctg 
Val Val Ser Gin Val Leu Cys Phe Leu 
290 295 

ctg ctt gtg tct gcg ctg gtg ctg ggc 
Leu Leu Val Ser Ala Leu Val Leu Gly 
305 310 

aca cgt ggc ttc cag cac aca ggc ate 
Thr Arg Gly Phe Gin His Thr Gly lie 
320 325 



- 24 - 

235 

ggg ate tac etc etc gtg ggc 767 
Gly lie Tyr Leu Leu Val Gly 
250 255 

gtg ttc ate tgg ate teg ttc 815 
Val Phe lie Trp lie Ser Phe 
265 270 

ctg ttc cag gec ctg etc aca 863 
Leu Phe Gin Ala Leu Leu Thr 
285 

gec ate gag tgg tac ctg ccc 911 
Ala lie Glu Trp Tyr Leu Pro 
300 

tgg ctg aac ctg ctt tac tat 959 
Trp Leu Asn Leu Leu Tyr Tyr 
315 

tac agt gtc atg ate cag aag 1007 
Tyr Ser Val Met lie Gin Lys 
330 335 



aaa gec ate tct gtc ctg gag atg gag aat ggc tat tgg tgg tgc agg 1055 
Lys Ala lie Ser Val Leu Glu Met Glu Asn Gly Tyr Trp Trp Cys Arg 
340 345 350 

aag aag cag egg gca ggt gtg atg ctg ace gtt ggc act aag cca gat 1103 
Lys Lys Gin Arg Ala Gly Val Met Leu Thr Val Gly Thr Lys Pro Asp 
355 360 365 

ggc age ccg gat gag cgc tgg tgc ttc agg gtg gag gag gtg aac tgg 1151 
Gly Ser Pro Asp Glu Arg Trp Cys Phe Arg Val Glu Glu Val Asn Trp 
370 375 380 

get tea tgg gag cag acg ctg cct acg ctg tgt gag gac ccg tea ggg 1199 
Ala Ser Trp Glu Gin Thr Leu Pro Thr Leu Cys Glu Asp Pro Ser Gly 
385 390 395 

gca ggt gtc cct cga act etc gag aac cct gtc ctg get tec cct ccc 1247 
Ala Gly Val Pro Arg Thr Leu Glu Asn Pro Val Leu Ala Ser Pro Pro 
400 405 410 415 

aag gag gat gag gat ggt gee tct gag gaa aac tat gtg ccc gtc cag 1295 
Lys Glu Asp Glu Asp Gly Ala Ser Glu Glu Asn Tyr Val Pro Val Gin 
420 425 430 

etc etc cag tec aac tgatg-gecca gatgeagcag gaggecagag gacagagcag 1350 
Leu Leu Gin Ser Asn 
435 

aggatctttc caaccacatc tgctggctct ggggtcccag tgaattctgg tggcaaatat 1410 
atattttcac taactcaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaagg 1470 
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agcggacgcg tgggtcgac 1489 



<210> 8 
<211> 436 
<212> PRT 

<213> Homo sapiens 
<400> 8 

Gly Arg Phe Phe Gin Lys Gly Gin Gly Thr Cys Phe Tyr Phe Gly Glu 
15 10 15 

Leu Pro Leu Ser Leu Ala Ala Cys Thr Lys Gin Trp Asp Val Val Ser 
20 25 30 

Tyr Leu Leu Glu Asn Pro His Gin Pro Ala Ser Leu Gin Ala Thr Asp 
35 40 45 

Ser Gin Gly Asn Thr Val Leu His Ala Leu Val Met lie Ser Asp Asn 
50 55 60 

Ser Ala Glu Asn lie Ala Leu Val Thr Ser Met Tyr Asp Gly Leu Leu 
65 70 75 80 



Gin Ala Gly Ala 



Asn Leu Gin Asp 
100 

He Glu He Phe 
115 

His Leu Ser Arg 
130 

Ser Leu Tyr Asp 
145 

Leu Glu He He 



Val Val Leu Glu 
180 

Leu lie Pro Lys 
195 

Phe He Phe Thr 
210 

Ala Ala Pro His 
225 

Gly His lie Leu 



Leu Trp Tyr Phe 



Arg Leu Cys Pro 
85 

Leu Thr Pro Leu 



Arg His lie Leu 
120 

Lys Phe Thr Glu 
135 

Leu Ala Ser Val 
150 

Ala Phe His Cys 
165 

Pro Leu Asn Lys 



Phe Phe Leu Asn 
200 

Ala Val Ala Tyr 
215 

Leu Lys Ala Glu 
230 

lie Leu Leu Gly 
245 

Trp Arg Arg His 



Thr Val Gin Leu 
90 

Lys Leu Ala Ala 
105 

Gin Arg Glu Phe 



Trp Cys Tyr Gly 
140 

Asp Ser Cys Glu 
155 

Lys Ser Pro His 
170 

Leu Leu Gin Ala 
185 

Phe Leu Cys Asn 



His Gin Pro Thr 
220 

Val Gly Asn Ser 
235 

Gly lie Tyr Leu 
250 

Val Phe He Trp 



Glu Asp He Arg 
95 

Lys Glu Gly Lys 
110 

Ser Gly Leu Ser 
125 

Pro Val Arg Val 



Glu Asn Ser Val 
160 

Arg His Arg Met 
175 

Lys Trp Asp Leu 
190 

Leu lie Tyr Met 
205 

Leu Lys Lys Gin 



Met Leu Leu Thr 
240 

Leu Val Gly Gin 
255 

He Ser Phe lie 
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260 

Asp Ser Tyr Phe 
275 

Val Ser Gin Val 
290 

Leu Val Ser Ala 
305 

Arg Gly Phe Gin 



Ala lie Ser Val 
340 

Lys Gin Arg Ala 
355 

Ser Pro Asp Glu 
370 

Ser Trp Glu Gin 
385 

Gly Val Pro Arg 



Glu Asp Glu Asp 
420 

Leu Gin Ser Asn 
435 



Glu lie Leu Phe 
280 

Leu Cys Phe Leu 
295 

Leu Val Leu Gly 
310 

His Thr Gly He 
325 

Leu Glu Met Glu 



Gly Val Met Leu 
360 

Arg Trp Cys Phe 
375 

Thr Leu Pro Thr 
390 

Thr Leu Glu Asn 
405 

Gly Ala Ser Glu 



-26- 



265 

Leu Phe Gin Ala 



Ala He Glu Trp 
300 

Trp Leu Asn Leu 
315 

Tyr Ser Val Met 
330 

Asn Gly Tyr Trp 
345 

Thr Val Gly Thr 



Arg Val Glu Glu 
380 

Leu Cys Glu Asp 
395 

Pro Val Leu Ala 
410 

Glu Asn Tyr Val 
425 



Leu Leu Thr Val 
285 

Tyr Leu Pro Leu 



Leu Tyr Tyr Thr 
320 

He Gin Lys Lys 
335 

Trp Cys Arg Lys 
350 

Lys Pro Asp Gly 
365 

Val Asn Trp Ala 



Pro Ser Gly Ala 
400 

Ser Pro Pro Lys 
415 

Pro Val Gin Leu 
430 



<210> 9 
<211> 1308 
<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1) . . (1308) 

<400> 9 

ggc cgc ttc ttc cag aag ggc caa ggg act tgc ttt tat ttc ggt gag 48 

Gly Arg Phe Phe Gin Lys Gly Gin Gly Thr Cys Phe Tyr Phe Gly Glu 
15 10 15 



eta ccc etc tct ttg gee get tgc acc aag cag tgg gat gtg gta age 96 
Leu Pro Leu Ser Leu Ala Ala Cys Thr Lys Gin Trp Asp Val Val Ser 
20 25 30 

tac etc ctg gag aac cca cac cag ccc gec age ctg cag gec act gac 144 
Tyr Leu Leu Glu Asn Pro His Gin Pro Ala Ser Leu Gin Ala Thr Asp 
35 40 45 
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tec cag ggc aac aca gtc ctg cat gec eta gtg atg ate teg gac aac 192 
Ser Gin Gly Asn Thr Val Leu His Ala Leu Val Met lie Ser Asp Asn 
50 55 60 

tea get gag aac att gca ctg gtg acc age atg tat gat ggg etc etc 240 
Ser Ala Glu Asn lie Ala Leu Val Thr Ser Met Tyr Asp Gly Leu Leu 
65 70 75 80 

caa get ggg gec cgc etc tgc cct acc gtg cag ctt gag gac ate cgc 288 
Gin Ala Gly Ala Arg Leu Cys Pro Thr Val Gin Leu Glu Asp lie Arg 
85 90 95 

aac ctg cag gat etc acg cct ctg aag ctg gee gee aag gag ggc aag 336 
Asn Leu Gin Asp Leu Thr Pro Leu Lys Leu Ala Ala Lys Glu Gly Lys 
100 105 110 

ate gag att ttc agg cac ate ctg cag egg gag ttt tea gga ctg age 384 
lie Glu lie Phe Arg His lie Leu Gin Arg Glu Phe Ser Gly Leu Ser 
115 120 125 

cac ctt tec cga aag ttc acc gag tgg tgc tat ggg cct gtc egg gtg 432 
His Leu Ser Arg Lys Phe Thr Glu Trp Cys Tyr Gly Pro Val Arg Val 
130 135 140 

teg ctg tat gac ctg get tct gtg gac age tgt gag gag aac tea gtg 480 
Ser Leu Tyr Asp Leu Ala Ser Val Asp Ser Cys Glu Glu Asn Ser Val 
145 150 155 160 

ctg gag ate att gec ttt cat tgc aag age ccg cac cga cac cga atg 528 
Leu Glu lie lie Ala Phe His Cys Lys Ser Pro His Arg His Arg Met 
165 170 175 

gtc gtt ttg gag ccc ctg aac aaa ctg ctg cag gcg aaa tgg gat ctg 576 
Val Val Leu Glu Pro Leu Asn Lys Leu Leu Gin Ala Lys Trp Asp Leu 
180 185 190 

etc ate ccc aag ttc ttc tta aac ttc ctg tgt aat ctg ate tac atg 624 
Leu lie Pro Lys Phe Phe Leu Asn Phe Leu Cys Asn Leu lie Tyr Met 
195 200 205 

ttc ate ttc acc get gtt gee tac cat cag cct acc ctg aag aag cag 672 
Phe lie Phe Thr Ala Val Ala Tyr His Gin Pro Thr Leu Lys Lys Gin 
210 215 220 

gee gec cct cac ctg aaa gcg gag gtt gga aac tec atg ctg ctg acg 720 
Ala Ala Pro His Leu Lys Ala Glu Val Gly Asn Ser Met Leu Leu Thr 
225 230 235 240 

ggc cac ate ctt ate ctg eta ggg ggg ate tac etc etc gtg ggc cag 768 
Gly His lie Leu lie Leu Leu Gly Gly lie Tyr Leu Leu Val Gly Gin 
245 250 255 



ctg tgg tac ttc tgg egg cgc cac 
Leu Trp Tyr Phe Trp Arg Arg His 
260 

gac age tac ttt gaa ate etc ttc 
Asp Ser Tyr Phe Glu lie Leu Phe 



gtg ttc ate tgg ate teg ttc ata 816 
Val Phe lie Trp lie Ser Phe lie 
265 270 

ctg ttc cag gee ctg etc aca gtg 864 
Leu Phe Gin Ala Leu Leu Thr Val 
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275 280 285 

gtg tec cag gtg ctg tgt ttc ctg gec ate gag tgg tac ctg ccc ctg 912 
Val Ser Gin Val Leu Cys Phe Leu Ala lie Glu Trp Tyr Leu Pro Leu 
290 295 300 

ctt gtg tct gcg ctg gtg ctg ggc tgg ctg aac ctg ctt tac tat aca 960 
Leu Val Ser Ala Leu Val Leu Gly Trp Leu Asn Leu Leu Tyr Tyr Thr 
305 310 315 320 

cgt ggc ttc cag cac aca ggc ate tac agt gtc atg ate cag aag aaa 1008 
Arg Gly Phe Gin His Thr Gly lie Tyr Ser Val Met lie Gin Lys Lys 
325 330 335 

gec ate tct gtc ctg gag atg gag aat ggc tat tgg tgg tgc agg aag 1056 
Ala lie Ser Val Leu Glu Met Glu Asn Gly Tyr Trp Trp Cys Arg Lys 
340 345 350 

aag cag egg gca ggt gtg atg ctg acc gtt ggc act aag cca gat ggc 1104 
Lys Gin Arg Ala Gly Val Met Leu Thr Val Gly Thr Lys Pro Asp Gly 
355 360 365 

age ccg gat gag cgc tgg tgc ttc agg gtg gag gag gtg aac tgg get 1152 
Ser Pro Asp Glu Arg Trp Cys Phe Arg Val Glu Glu Val Asn Trp Ala 
370 375 380 

tea tgg gag cag acg ctg cct acg ctg tgt gag gac ccg tea ggg gca 1200 
Ser Trp Glu Gin Thr Leu Pro Thr Leu Cys Glu Asp Pro Ser Gly Ala 
385 390 395 400 

ggt gtc cct cga act etc gag aac cct gtc ctg get tec cct ccc aag 1248 
Gly Val Pro Arg Thr Leu Glu Asn Pro Val Leu Ala Ser Pro Pro Lys 
405 410 415 

gag gat gag gat ggt gee tct gag gaa aac tat gtg ccc gtc cag etc 1296 
Glu Asp Glu Asp Gly Ala Ser Glu Glu Asn Tyr Val Pro Val Gin Leu 
420 425 430 

etc cag tec aac 1308 
Leu Gin Ser Asn 
435 



<210> 10 
<211> 1794 
<212> DNA 
<213> Rattus sp. 
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<220> 

<221> CDS 

<222> (2) . . (1663) 

<400> 10 

g teg acc cac gcg tec get ctt tct ctg get gcg tgc acc aag cag tgg 49 
Ser Thr His Ala Ser Ala Leu Ser Leu Ala Ala Cys Thr Lys Gin Trp 
15 10 15 

gat gtg gtg acc tac etc ctg gag aac cca cac cag ccg gee age ctg 97 
Asp Val Val Thr Tyr Leu Leu Glu Asn Pro His Gin Pro Ala Ser Leu 
20 25 30 

gag gec acc gac tec ctg ggc aac aca gtc ctg cat get ctg gta atg 145 
Glu Ala Thr Asp Ser Leu Gly Asn Thr Val Leu His Ala Leu Val Met 
35 40 45 

att nca gat aac teg cct gag aac agt gec ctg gtg ate cac atg tac 193 
Tie Aia Asp Asn Ser Pro Glu Asn Ser Ala Leu Val He His Met Tyr 
50 55 60 

gac gqg ctt eta caa atg ggg gcg cgc etc tgc ccc act gtg cag ctt 241 
Asp Giy Leu Leu Gin Met Gly Ala Arg Leu Cys Pro Thr Val Gin Leu 
6b 70 75 80 

gag gaa ate tec aac cac caa ggc etc aca ccc ctg aaa eta gec gec 289 
Glu Glu He Ser Asn His Gin Gly Leu Thr Pro Leu Lys Leu Ala Ala 
85 90 95 

aag gaa ggc aaa ate gag att ttc agg cac att ctg cag egg gaa ttc 337 
Lys Glu Gly Lys He Glu He Phe Arg His lie Leu Gin Arg Glu Phe 
100 105 110 

tea gga ccg tac cag ccc ctt tec cga aag ttt act gag tgg tgt tac 385 
Ser Gly Pro Tyr Gin Pro Leu Ser Arg Lys Phe Thr Glu Trp Cys Tyr 
115 120 125 

ggt cct gtg egg gta teg ctg tac gac ctg tec tct gtg gac age tgg 433 
Gly Pro Val Arg Val Ser Leu Tyr Asp Leu Ser Ser Val Asp Ser Trp 
130 135 140 

gaa aag aac teg gtg ctg gag ate ate get ttt cat tgc aag age ccg 481 
Glu Lys Asn Ser Val Leu Glu He He Ala Phe His Cys Lys Ser Pro 
145 150 155 160 

aac egg cac cgc atg gtg gtt tta gaa cca ctg aac aag ctt ctg cag 529 
Asn Arg His Arg Met Val Val Leu Glu Pro Leu Asn Lys Leu Leu Gin 
165 170 175 

gag aaa tgg gat egg etc gtc tea aga ttc ttc ttc aac ttc gec tgc 577 
Glu Lys Trp Asp Arg Leu Val Ser Arg Phe Phe Phe Asn Phe Ala Cys 
180 185 190 

tac ttg gtc tac atg ttc ate ttc acc gtc gtt gee tac cac cag cct 625 
Tyr Leu Val Tyr Met Phe He Phe Thr Val Val Ala Tyr His Gin Pro 
195 200 205 

tec ctg gat cag cca gee ate ccc tea tea aaa gcg act ttt ggg gaa 673 
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Ser Leu Asp Gin Pro Ala lie Pro Ser Ser Lys Ala Thr Phe Gly Glu 
210 215 220 

tec atg ctg ctg ctg ggc cac att ctg ate ctg ctt ggg ggt att tac 721 
Ser Met Leu Leu Leu Gly His lie Leu lie Leu Leu Gly Gly lie Tyr 
225 230 235 240 

etc tta ctg ggc cag ctg tgg tac ttt tgg egg egg cgc ctg ttt ate 769 
Leu Leu Leu Gly Gin Leu Trp Tyr Phe Trp Arg Arg Arg Leu Phe lie 
245 250 255 

tgg ate tea ttc atg gac age tac ttt gaa ate etc ttt etc ctt cag 817 
Trp lie Ser Phe Met Asp Ser Tyr Phe Glu lie Leu Phe Leu Leu Gin 
260 265 270 

get ctg etc aca gtg ctg tec cag gtg ctg cgc ttc atg gag act gaa 865 
Ala Leu Leu Thr Val Leu Ser Gin Val Leu Arg Phe Met Glu Thr Glu 
275 280 285 

tgg tac eta ccc ctg eta gtg tta tec eta gtg ctg ggc tgg ctg aac 913 
Trp Tyr Leu Pro Leu Leu Val Leu Ser Leu Val Leu Gly Trp Leu Asn 
290 295 300 

ctg ctt tac tac aca egg ggc ttt cag cac aca ggc ate tac agt gtc 961 
Leu Leu Tyr Tyr Thr Arg Gly Phe Gin His Thr Gly He Tyr Ser Val 
305 310 315 320 

atg ate cag aag gtc ate ctt cga gac ctg etc cgt ttc ctg ctg gtc 1009 
Met He Gin Lys Val He Leu Arg Asp Leu Leu Arg Phe Leu Leu Val 
325 330 335 

tac ctg gtc ttc ctt ttc ggc ttt get gta gee eta gta age ttg age 1057 
Tyr Leu Val Phe Leu Phe Gly Phe Ala Val Ala Leu Val Ser Leu Ser 
340 345 350 

aga gag gee cga agt ccc aaa gee cct gaa gat aac aac tec aca gtg 1105 
Arg Glu Ala Arg Ser Pro Lys Ala Pro Glu Asp Asn Asn Ser Thr Val 
355 360 365 

acg gaa cag ccc acg gtg ggc cag gag gag gag cca get cca tat egg 1153 
Thr Glu Gin Pro Thr Val Gly Gin Glu Glu Glu Pro Ala Pro Tyr Arg 
370 375 380 

age att ctg gat gee tec eta gag ctg ttc aag ttc ace att ggt atg 1201 
Ser He Leu Asp Ala Ser Leu Glu Leu Phe Lys Phe Thr lie Gly Met 
385 390 395 400 

ggg gag ctg get ttc cag gaa cag ctg cgt ttt cgt ggg gtg gtc ctg 124 9 
Gly Glu Leu Ala Phe Gin Glu Gin Leu Arg Phe Arg Gly Val Val Leu 
405 410 415 

ctg ttg ctg ttg gee tac gtc ctt etc ace tac gtc ctg ctg etc aac 1297 
Leu Leu Leu Leu Ala Tyr Val Leu Leu Thr Tyr Val Leu Leu Leu Asn 
420 425 430 

atg etc att get etc atg age gaa act gtc aac cac gtt get gac aac 1345 
Met Leu He Ala Leu Met Ser Glu Thr Val Asn His Val Ala Asp Asn 
435 440 445 
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age tgg age ate tgg aag ttg cag aaa gec ate tct gtc ttg gag atg 1393 
Ser Trp Ser lie Trp Lys Leu Gin Lys Ala lie Ser Val Leu Glu Met 
450 455 460 

gag aat ggt tac tgg tgg tgc egg agg aag aaa cat cgt gaa ggg agg 1441 
Glu Asn Gly Tyr Trp Trp Cys Arg Arg Lys Lys His Arg Glu Gly Arg 
465 470 475 480 

ctg ctg aaa gtc ggc acc agg ggg gat ggt ace cct gat gag cgc tgg 14 89 
Leu Leu Lys Val Gly Thr Arg Gly Asp Gly Thr Pro Asp Glu Arg Trp 
485 490 495 

tgc ttc agg gtg gag gaa gta aat tgg get get tgg gag aag act ctt 1537 
Cys Phe Arg Val Glu Glu Val Asn Trp Ala Ala Trp Glu Lys Thr Leu 
500 505 510 

ccc acc tta tct gag gat cca tea ggg cca ggc ate act ggt aat aaa 1585 
Pro Thr Leu Ser Glu Asp Pro Ser Gly Pro Gly lie Thr Gly Asn Lys 
515 520 525 



aag aac cca acc tct aaa ccg ggg aag aac agt gec tea gag gaa gac 1633 
Lys Asn Pro Thr Ser Lys Pro Gly Lys Asn Ser Ala Ser Glu Glu Asp 
530 535 540 

cat ctg ccc ctt cag gtc etc cag tec ccc tgatggccca gatgeagcag 1683 
His Leu Pro Leu Gin Val Leu Gin Ser Pro 
545 550 

caggctggca ggatggagta gggaatcttc ccagccacac cagaggctac tgaattttgg 1743 
tggaaatata aatatttttt ttgcataaaa aaaaaaaaaa agggeggecg c 1794 

<210> 11 

<211> 554 

<212> PRT 

<213> Rattus sp. 

<400> 11 

Ser Thr His Ala Ser Ala Leu Ser Leu Ala Ala Cys Thr Lys Gin Trp 
15 10 15 

Asp Val Val Thr Tyr Leu Leu Glu Asn Pro His Gin Pro Ala Ser Leu 
20 25 30 

Glu Ala Thr Asp Ser Leu Gly Asn Thr Val Leu His Ala Leu Val Met 
35 40 45 

lie Ala Asp Asn Ser Pro Glu Asn Ser Ala Leu Val lie His Met Tyr 
50 55 60 

Asp Gly Leu Leu Gin Met Gly Ala Arg Leu Cys Pro Thr Val Gin Leu 
65 70 75 80 

Glu Glu lie Ser Asn His Gin Gly Leu Thr Pro Leu Lys Leu Ala Ala 
85 90 95 
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Lys Glu Gly Lys 
100 

Ser Gly Pro Tyr 
115 

Gly Pro Val Arg 
130 

Glu Lys Asn Ser 
145 

Asn Arg His Arg 



Glu Lys Trp Asp 
180 

Tyr Leu Val Tyr 
1 95 

Ser Leu Asp Gin 
210 

Ser Met Leu Leu 
225 

Leu Leu Leu Gly 



Trp lie Ser Phe 
260 

Ala Leu Leu Thr 
275 

Trp Tyr Leu Pro 
290 

Leu Leu Tyr Tyr 
305 

Met lie Gin Lys 



Tyr Leu Val Phe 
340 

Arg Glu Ala Arg 

355 

Thr Glu Gin Pro 
370 

Ser lie Leu Asp 
385 

Gly Glu Leu Ala 



He Glu He Phe 



Gin Pro Leu Ser 
120 

Val Ser Leu Tyr 
135 

Val Leu Glu He 
150 

Met Val Val Leu 
165 

Arg Leu Val Ser 



Met Phe He Phe 
200 

Pro Ala lie Pro 
215 

Leu Gly His He 
230 

Gin Leu Trp Tyr 
245 

Met Asp Ser Tyr 



Val Leu Ser Gin 
280 

Leu Leu Val Leu 
295 

Thr Arg Gly Phe 
310 

Val He Leu Arg 
325 

Leu Phe Gly Phe 



Ser Pro Lys Ala 
360 

Thr Val Gly Gin 
375 

Ala Ser Leu Glu 
390 

Phe Gin Glu Gin 
405 



Arg His He Leu 
105 

Arg Lys Phe Thr 



Asp Leu Ser Ser 
140 

He Ala Phe His 
155 

Glu Pro Leu Asn 
170 

Arg Phe Phe Phe 
185 

Thr Val Val Ala 



Ser Ser Lys Ala 
220 

Leu He Leu Leu 
235 

Phe Trp Arg Arg 
250 

Phe Glu He Leu 
265 

Val Leu Arg Phe 



Ser Leu Val Leu 
300 

Gin His Thr Gly 
315 

Asp Leu Leu Arg 
330 

Ala Val Ala Leu 
345 

Pro Glu Asp Asn 



Glu Glu Glu Pro 
380 

Leu Phe Lys Phe 
395 

Leu Arg Phe Arg 
410 



Gin Arg Glu Phe 
110 

Glu Trp Cys Tyr 
125 

Val Asp Ser Trp 



Cys Lys Ser Pro 
160 

Lys Leu Leu Gin 
175 

Asn Phe Ala Cys 
190 

Tyr His Gin Pro 
205 

Thr Phe Gly Glu 



Gly Gly He Tyr 
240 

Arg Leu Phe He 
255 

Phe Leu Leu Gin 
270 

Met Glu Thr Glu 
285 

Gly Trp Leu Asn 



lie Tyr Ser Val 
320 

Phe Leu Leu Val 
335 

Val Ser Leu Ser 
350 

Asn Ser Thr Val 
365 

Ala Pro Tyr Arg 



Thr He Gly Met 
400 

Gly Val Val Leu 
415 
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Leu Leu Leu Leu 
420 

Met Leu lie Ala 
435 

Ser Trp Ser lie 
450 

Glu Asn Gly Tyr 
465 

Leu Leu Lys Val 



Cys Phe Arg Val 
500 

Pro Thr Leu Ser 

515 

Lys Asn Pro Thr 
530 

His Leu Pro Leu 
545 



Ala Tyr Val Leu 



Leu Met Ser Glu 
440 

Trp Lys Leu Gin 
455 

Trp Trp Cys Arg 
470 

Gly Thr Arg Gly 
485 

Glu Glu Val Asn 



Glu Asp Pro Ser 
520 

Ser Lys Pro Gly 
535 

Gin Val Leu Gin 
550 



Leu Thr Tyr Val 
425 

Thr Val Asn His 



Lys Ala lie Ser 
460 

Arg Lys Lys His 
475 

Asp Gly Thr Pro 
4 90 

Trp Ala Ala Trp 
505 

Gly Pro Gly lie 



Lys Asn Ser Ala 
540 

Ser Pro 



Leu Leu Leu Asn 
430 

Val Ala Asp Asn 
445 

Val Leu Glu Met 



Arg Glu Gly Arg 
480 

Asp Glu Arg Trp 
495 

Glu Lys Thr Leu 
510 

Thr Gly Asn Lys 
525 

Ser Glu Glu Asp 



<210> 12 
<211> 1662 
<212> DNA 
<213> Rattus sp. 

<220> 

<221> CDS 

<222> (1) . . (1662) 

<400> 12 

teg acc cac gcg tec get ctt tct ctg get gcg tgc acc aag cag tgg 48 

Ser Thr His Ala Ser Ala Leu Ser Leu Ala Ala Cys Thr Lys Gin Trp 

15 10 15 

gat gtg gtg acc tac etc ctg gag aac cca cac cag ccg gee age ctg 96 
Asp Val Val Thr Tyr Leu Leu Glu Asn Pro His Gin Pro Ala Ser Leu 
20 25 30 

gag gec acc gac tec ctg ggc aac aca gtc ctg cat get ctg gta atg 144 
Glu Ala Thr Asp Ser Leu Gly Asn Thr Val Leu His Ala Leu Val Met 
35 40 45 

att gca gat aac teg cct gag aac agt gec ctg gtg ate cac atg tac 192 
lie Ala Asp Asn Ser Pro Glu Asn Ser Ala Leu Val lie His Met Tyr 
50 55 60 

gac ggg ctt eta caa atg ggg gcg cgc etc tgc ccc act gtg cag ctt 240 
Asp Gly Leu Leu Gin Met Gly Ala Arg Leu Cys Pro Thr Val Gin Leu 
65 70 75 80 
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gag gaa ate tec aac cac caa ggc etc aca ccc ctg aaa eta gee gee 288 
Glu Glu lie Ser Asn His Gin Gly Leu Thr Pro Leu Lys Leu Ala Ala 

85 90 95 

aag gaa ggc aaa ate gag att ttc agg cac att ctg cag egg gaa ttc 336 
Lys Glu Gly Lys lie Glu lie Phe Arg His lie Leu Gin Arg Glu Phe 
100 105 110 

tea gga ccg tac cag ccc ctt tec cga aag ttt act gag tgg tgt tac 384 
Ser Gly Pro Tyr Gin Pro Leu Ser Arg Lys Phe Thr Glu Trp Cys Tyr 
115 120 125 

ggt cct gtg egg gta teg ctg tac gac ctg tec tct gtg gac age tgg 432 
Gly Pro Val Arg Val Ser Leu Tyr Asp Leu Ser Ser Val Asp Ser Trp 
130 135 140 

gaa aag aac teg gtg ctg gag ate ate get ttt cat tgc aag age ccg 480 
Glu Lys Asn Ser Val Leu Glu lie lie Ala Phe His Cys Lys Ser Pro 
145 150 155 160 

aac egg cac cgc atg gtg gtt tta gaa cca ctg aac aag ctt ctg cag 528 
Asn Arg His Arg Met Val Val Leu Glu Pro Leu Asn Lys Leu Leu Gin 
165 170 175 

gag aaa tgg gat egg etc gtc tea aga ttc ttc ttc aac ttc gee tgc 576 
Glu Lys Trp Asp Arg Leu Val Ser Arg Phe Phe Phe Asn Phe Ala Cys 
180 185 190 

tac ttg gtc tac atg ttc ate ttc ace gtc gtt gee tac cac cag cct 624 
Tyr Leu Val Tyr Met Phe lie Phe Thr Val Val Ala Tyr His Gin Pro 
195 200 205 

tec ctg gat cag cca gee ate ccc tea tea aaa gcg act ttt ggg gaa 672 
Ser Leu Asp Gin Pro Ala lie Pro Ser Ser Lys Ala Thr Phe Gly Glu 
210 215 220 

tec atg ctg ctg ctg ggc cac att ctg ate ctg ctt ggg ggt att tac 720 
Ser Met Leu Leu Leu Gly His lie Leu lie Leu Leu Gly Gly lie Tyr 
225 230 235 240 

etc tta ctg ggc cag ctg tgg tac ttt tgg egg egg cgc ctg ttt ate 768 
Leu Leu Leu Gly Gin Leu Trp Tyr Phe Trp Arg Arg Arg Leu Phe lie 
245 250 255 

tgg ate tea ttc atg gac age tac ttt gaa ate etc ttt etc ctt cag 816 
Trp lie Ser Phe Met Asp Ser Tyr Phe Glu lie Leu Phe Leu Leu Gin 
260 265 270 

get ctg etc aca gtg ctg tec cag gtg ctg cgc ttc atg gag act gaa 864 
Ala Leu Leu Thr Val Leu Ser Gin Val Leu Arg Phe Met Glu Thr Glu 
275 280 285 

tgg tac eta ccc ctg eta gtg tta tec eta gtg ctg ggc tgg ctg aac 912 
Trp Tyr Leu Pro Leu Leu Val Leu Ser Leu Val Leu Gly Trp Leu Asn 
290 295 300 

ctg ctt tac tac aca egg ggc ttt cag cac aca ggc ate tac agt gtc 960 
Leu Leu Tyr Tyr Thr Arg Gly Phe 1 Gin His Thr Gly lie Tyr Ser Val 
305 310 315 320 
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atg ate cag aag gtc ate ctt cga gac ctg etc cgt ttc ctg ctg gtc 
Met lie Gin Lys Val lie Leu Arg Asp Leu Leu Arg Phe Leu Leu Val 
325 330 335 



1008 



tac ctg gtc ttc ctt ttc ggc ttt get gta gec eta gta age ttg age 
Tyr Leu Val Phe Leu Phe Gly Phe Ala Val Ala Leu Val Ser Leu Ser 
340 345 350 



1056 



aga gag gec cga agt ccc aaa gec cct gaa gat aac aac tec aca gtg 
Arg Glu Ala Arg Ser Pro Lys Ala Pro Glu Asp Asn Asn Ser Thr Val 
355 360 365 



1104 



acg gaa cag ccc acg gtg ggc cag gag gag gag cca get cca tat egg 
Thr Glu Gin Pro Thr Val Gly Gin Glu Glu Glu Pro Ala Pro Tyr Arg 
370 375 380 



1152 



age att ctg gat gee tec eta gag ctg ttc aag ttc ace att ggt atg 
Ser lie Leu Asp Ala Ser Leu Glu Leu Phe Lys Phe Thr lie Gly Met 
385 390 395 400 



1200 



ggg gag ctg get ttc cag gaa cag ctg cgt ttt cgt ggg gtg gtc ctg 
Gly Glu Leu Ala Phe Gin Glu Gin Leu Arg Phe Arg Gly Val Val Leu 
405 410 415 



1248 



ctg ttg ctg ttg gec tac gtc ctt etc acc tac gtc ctg ctg etc aac 
Leu Leu Leu Leu Ala Tyr Val Leu Leu Thr Tyr Val Leu Leu Leu Asn 
420 425 430 



1296 



atg etc att get etc atg age gaa act gtc aac cac gtt get gac aac 
Met Leu lie Ala Leu Met Ser Glu Thr Val Asn His Val Ala Asp Asn 
435 440 445 



1344 



age tgg age ate tgg aag ttg cag aaa gee ate tct gtc ttg gag atg 
Ser Trp Ser lie Trp Lys Leu Gin Lys Ala lie Ser Val Leu Glu Met 
450 455 460 



1392 



gag aat ggt tac tgg tgg tgc egg agg aag aaa cat cgt gaa ggg agg 
Glu Asn Gly Tyr Trp Trp Cys Arg Arg Lys Lys His Arg Glu Gly Arg 
465 470 475 480 



1440 



ctg ctg aaa gtc ggc acc agg ggg gat ggt acc cct gat gag cgc tgg 
Leu Leu Lys Val Gly Thr Arg Gly Asp Gly Thr Pro Asp Glu Arg Trp 
485 490 495 



1488 



tgc ttc agg gtg gag gaa gta aat tgg get get tgg gag aag act ctt 
Cys Phe Arg Val Glu Glu Val Asn Trp Ala Ala Trp Glu Lys Thr Leu 
500 505 510 



1536 



ccc acc tta tct gag gat cca tea ggg cca ggc ate act ggt aat aaa 
Pro Thr Leu Ser Glu Asp Pro Ser Gly Pro Gly lie Thr Gly Asn Lys 
515 520 525 



1584 



aag aac cca acc tct aaa ccg ggg aag aac agt gec tea gag gaa gac 
Lys Asn Pro Thr Ser Lys Pro Gly Lys Asn Ser Ala Ser Glu Glu Asp 
530 535 540 



1632 



cat ctg ccc ctt cag gtc etc cag. tec ccc 
His Leu Pro Leu Gin Val Leu Gin Ser Pro 



1662 
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545 550 



<210> 13 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic 
peptide 

<400> 13 

Ala Phe His Cys Lys Ser Pro His Arg His Arg Met Val Val Leu Glu 
15 10 15 



<210> 14 
<211> 25 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic > 
peptide 

<400> 14 

Arg Pro Glu Ala Pro Thr Gly Pro Asn Ala Thr Glu Ser Val Gin Pro % 
1 5 10 15 5 

Met Glu Gly Gin Glu Asp Glu Gly Asn 
20 25 

<210> 15 
<211> 19 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic 
peptide 

<400> 15 

Ser Val Leu Glu Met Glu Asn Gly Tyr Trp Trp Cys Arg Lys Lys Gin 
15 10 15 

Arg Ala Gly 



<210> 16 
<211> 20 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 16 

taggagaccc cgttgccacg 20 



WO 00/29577 



<210> 17 
<211> 22 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence : primer 
<400> 17 

gattcacttg gggacagtga eg 22 



<210> 18 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 
<400> 18 

ttaagctccc gttctccacc g 21 



<210> 19 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 

<400> 19 

getgegggag gaagtgaagc 20 



<210> 20 
<211> 630 
<212> PRT 

<213> Homo sapiens 
<400> 20 

Met Thr Ser Pro Ser Ser Ser Pro Val Phe Arg Leu Glu Thr Leu Asp 
15 10 15 

Gly Gly Gin Glu Asp Gly Ser Glu Ala Asp Arg Gly Lys Leu Asp Phe 
20 25 30 
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Gly Ser Gly Leu Pro Pro Met Glu 
35 40 

Lys Phe Ala Pro Gin lie Arg Val 
50 55 

Gly Ala Ser Gin Pro Asp Pro Asn 
65 70 

Asn Ala Val Ser Arg Gly Val Pro 

85 



Ser Gin Phe Gin Gly Glu Asp Arg 
45 

Asn Leu Asn Tyr Arg Lys Gly Thr 
60 

Arg Phe Asp Arg Asp Arg Leu Phe 
75 80 

Glu Asp Leu Ala Gly Leu Pro Glu 
90 95 



Tyr Leu Ser Lys Thr Ser Lys Tyr Leu Thr Asp Ser Glu Tyr Thr Glu 
100 105 110 

Gly Ser Thr Gly Lys Thr Cys Leu Met Lys Ala Val Leu Asn Leu Lys 
115 120. 125 

Asp Gly Val Asn Ala Cys lie Leu Pro Leu Leu Gin lie Asp Arg Asp 
130 135 140 

Ser Gly Asn Pro Gin Pro Leu Val Asn Ala Gin Cys Thr Asp Asp Tyr 
145 150 155 160 

Tyr Arg Gly His Ser Ala Leu His lie Ala lie Glu Lys Arg Ser Leu 
165 170 175 

Gin Cys Val Lys Leu Leu Val Glu Asn Gly Ala Asn Val His Ala Arg 
180 185 190 

Ala Cys Gly Arg Phe Phe Gin Lys Gly Gin Gly Thr Cys Phe Tyr Phe 
195 200 205 

Gly Glu Leu Pro Leu Ser Leu Ala Ala Cys Thr Lys Gin Trp Asp Val 
210 215 220 

Val Ser Tyr Leu Leu Glu Asn Pro His Gin Pro Ala Ser Leu Gin Ala. 
225 230 235 240 

Thr Asp Ser Gin Gly Asn Thr Val Leu His Ala Leu Val Met lie Ser 
245 250 255 

Asp Asn Ser Ala Glu Asn lie Ala Leu Val Thr Ser Met Tyr Asp Gly 
260 265 270 

Leu Leu Gin Ala Gly Ala Arg Leu Cys Pro Thr Val Gin Leu Glu Asp 
275 280 285 

lie Arg Asn Leu Gin Asp Leu Thr Pro Leu Lys Leu Ala Ala Lys Glu 
290 295 300 

Gly Lys lie Glu lie Phe Arg His lie Leu Gin Arg Glu Phe Ser Gly 
305 310 315 320 

Leu Ser His Leu Ser Arg Lys Phe Thr Glu Trp Cys Tyr Gly Pro Val 
325 330 335 
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Arg Val Ser Leu Tyr Asp Leu Ala Ser Val Asp Ser Cys Glu Glu Asn 
340 345 350 

Ser Val Leu Glu lie lie Ala Phe His Cys Lys Ser Pro His Arg His 
355 360 365 

Arg Met Val Val Leu Glu Pro Leu Asn Lys Leu Leu Gin Ala Lys Trp 
370 375 380 

Asp Leu Leu lie Pro Lys Phe Phe Leu Asn Phe Leu Cys Asn Leu lie 
385 390 395 400 

Tyr Met Phe lie Phe Thr Ala Val Ala Tyr His Gin Pro Thr Leu Lys 
405 410 415 

Lys Gin Ala Ala Pro His Leu Lys Ala Glu Val Gly Asn Ser Met Leu 
420 425 430 

Leu Thr Gly His lie Leu lie Leu Leu Gly Gly lie Tyr Leu Leu Val 
435 440 445 

Giy Gin Leu Trp Tyr Phe Trp Arg Arg His Val Phe He Trp lie Ser 
450 455 460 

Phe lie Asp Ser Tyr Phe Glu He Leu Phe Leu Phe Gin Ala Leu Leu 
465 470 475 480 

Thr Val Val Ser Gin Val Leu Cys Phe Leu Ala He Glu Trp Tyr Leu 
485 490 495 

Pro Leu Leu Val Ser Ala Leu Val Leu Gly Trp Leu Asn Leu Leu Tyr 
500 505 510 

Tyr Thr Arg Gly Phe Gin His Thr Gly He Tyr Ser Val Met He Gin 
515 520 525 

Lys Lys Ala He Ser Val Leu Glu Met Glu Asn Gly Tyr Trp Trp Cys 
530 535 540 

Arg Lys Lys Gin Arg Ala Gly Val Met Leu Thr Val Gly Thr Lys Pro 
545 550 555 560 

Asp Gly Ser Pro Asp Glu Arg Trp Cys Phe Arg Val Glu Glu Val Asn 
565 570 575 

Trp Ala Ser Trp Glu Gin Thr Leu Pro Thr Leu Cys Glu Asp Pro Ser 
580 585 590 

Gly Ala Gly Val Pro Arg Thr Leu Glu Asn Pro Val Leu Ala Ser Pro 
595 600 605 

Pro Lys Glu Asp Glu Asp Gly Ala Ser Glu Glu Asn Tyr Val Pro Val 
610 615 620 

Gin Leu Leu Gin Ser Asn 
625 630 
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NOVEL MEMBERS OF THE CAPSAICIN/VANILLOID RECEPTOR FAMILY 

OF PROTEINS AND USES THEREOF 



Background of the Invention 

5 Pain is initiated when the peripheral terminals of a subgroup of sensory neurons 

are activated by noxious chemical, mechanical or thermal stimuli. These neurons, called 
nociceptors, transmit information regarding tissue damage to pain-processing centres in 
the spinal chord and brain (Fields, H.L. Pain, McGraw-Hill, New York, 1987). 
Nociceptors are characterized in part, by their sensitivity to capsaicin, a vanilloid- 

10 containing compound, and a natural product of capsicum peppers that is the active 
ingredient of many "hot" and spicy foods. In mammals, exposure of nociceptor 
terminals to capsaicin leads initially to excitation of the neuron and the consequent 
perception of pain and local release of inflammatory mediators. With prolonged 
exposure, nociceptor terminals become insensitive to capsaicin, as well as to other 

15 noxious stimuli (Szolcsanyi, J. in Capsaicin in the Study of Pain (ed. Wood, J.) 1-26 
(Academic, London, 1993). This latter phenomenon of nociceptor desensitization 
underlies the seemingly paradoxical use of capsaicin as an analgesic agent in the 
treatment of painful disorders ranging from viral and diabetic neuropathies to 
rheumatoid arthritis (Campbell, E. in Capsaicin and the StudyofPain (ed. Wood, J.) 

20 255-272 (Academic, London, 1993); Szallasi, A. et al (1996) Pain 68, 195-208). Some 
of this decreased sensitivity to noxious stimuli may result from reversible changes in the 
nociceptor, but the long-term loss of responsiveness can be explained by death of the 
nociceptor or destruction of its peripheral terminals following exposure to capsaicin 
(Jancso, G. et al (1977) Nature 270, 741-743). 

25 The cellular specificity of capsaicin action and its ability to evoke the sensation 

of burning pain have led to speculation that the target of capsaicin action plays an 
important physiological role in the detection of painful stimuli. Indeed, capsaicin may 
elicit the perception of pain by mimicking the actions of a physiological stimulus or an 
endogenous ligand produced dining tissue injury (James, I.F., Kinkina, N.N. & Wood, 

30 J.N. in Capsaicin in the Study of Pain (ed. Wood, J.N.) 83-104 (Academic, London, 
1993). 
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Caterina M J. et al have recently determined the molecular basis underlying this 
phenomenon by characterizing a functional cDNA that encodes a vanilloid receptor 
(VR-1) in rat sensory ganglia (Caterina M. J. et al, (1997) Nature 389:816-824). VR-1 
is a vanilloid-gated, nonselective cation channel that resembles members of the transient 
5 receptor potential (TRP) channel family, first identified as components of the 

Drosophila phototransduction pathway (Montell et al (1989) Neuron 2:1313-1323). 
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Summary of the Invention 

The present invention is based, at least in part, on the discovery of novel 
members of the Capsaicin/Vanilloid family of receptors. Described herein is the 

10 isolation of the human orthologue of rat VR-1 (rVR-1), referred to herein as hVR-1 , as 
well as another previously unknown member of the VR family of receptors, referred 
herein as VR-2, and specifically as human VR-2 (hVR-2. including an alternate form 
which contains a deletion) and rat VR-2 (rVR-2) nucleic acid and protein molecules. 
The hVR-1, hVR-2, and rVR-2 molecules of the present invention are useful as targets 

15 for developing modulating agents to regulate a variety of cellular processes, e.g., cellular 
processes involved in pain. Accordingly, in one aspect, this invention provides isolated 
nucleic acid molecules encoding hVR-1, hVR-2, and rVR-2 proteins and fragments 
thereof, as well as nucleic acid fragments suitable as primers or hybridization probes for 
the detection of hVR-1, hVR-2, and rVR-2-encoding nucleic acids. 

20 In one embodiment, an hVR-1, hVR-2, or rVR-2 nucleic acid molecule of the 

invention is at least 60%, 65%, 70%, 75%, 80%, 83%, 85%, 86%, 87%, 88%, 89%, 
90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or more identical to the 
nucleotide sequence {e.g., to the entire length of the nucleotide sequence) shown in SEQ 
ID NO:l, 3, 4, 6, 7, 9, 10, or 12 or a complement thereof. 

25 In another embodiment, the isolated nucleic acid molecule includes the 

nucleotide sequence shown SEQ ID NO:l, 3, 4, 6, 7, 9, 10, or 12, or a complement 
thereof. In another embodiment, the nucleic acid molecule includes at least 10, 15, 20, 
or more contiguous nucleotides of SEQ ID NO:l, 3, 4, 6, 7, 9, 10, or 12. 
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In another embodiment, an hVR-1, hVR-2, and rVR-2 nucleic acid molecule 
includes a nucleotide sequence encoding a protein having an amino acid sequence 
sufficiently homologous to the amino acid sequence of SEQ ID NO:2, 5, 8, or 1 1 . In 
one embodiment, an hVR-1, hVR-2, and rVR-2 nucleic acid molecule includes a 
5 nucleotide sequence encoding a protein having an amino acid sequence at least 60%, 
65%, 70%, 75%, 80%, 85%, 87%, 90%, 95%, 98% or more identical to the entire length 
of the amino acid sequence of SEQ ID NO:2, 5, 8, or 1 1 . 

Another embodiment of the invention features nucleic acid molecules, preferably 
hVR-1, hVR-2, and rVR-2 nucleic acid molecules, which specifically detect hVR-1, 

10 hVR-2, and rVR-2 nucleic acid molecules relative to nucleic acid molecules encoding 
non-hVR-1, non-hVR-2, and non-hVR-2 proteins. For example, in one embodiment, 
such a nucleic acid molecule is at least 100-150, 1 150-200, 200-250, 250-300, 300-350, 
350-400, 400-450, 450-500, 500-550, 550-600, 600-700, 700-800, 800-900, 900-1000, 
1088, or more nucleotides in length and hybridizes under stringent conditions to a 

15 nucleic acid molecule comprising the nucleotide sequence shown in SEQ ID NO:l, 3, 4, 
6, 7, 9 7 10, or 12. In preferred embodiments, the nucleic acid molecules are at least 15 
(e.g., contiguous) nucleotides in length and hybridize under stringent conditions to 
nucleotides 1-17, 3696-3863, or 3901-3909 of SEQ ID NO:l. In other preferred 
embodiments, the nucleic acid molecules comprise nucleotides 1-17, 3696-3863, or 

20 3901-3909 of SEQ ID NO:l. In yet other preferred embodiments, the nucleic acid 

molecules consist of nucleotides 1-17, 3696-3863, or 3901-3909 of SEQ ID NO:l. In 
preferred embodiments, the nucleic acid molecules are at least 15 (e.g., contiguous) 
nucleotides in length and hybridize under stringent conditions to nucleotides 1 944-2003 
of SEQ ID NO:4. In other preferred embodiments, the nucleic acid molecules comprise 

25 nucleotides 1 944-2003 of SEQ ID NO:4. In yet other preferred embodiments, the 
nucleic acid molecules consist of nucleotides 1944-2003 of SEQ ID NO:4. 

In other embodiments, the nucleic acid molecule encodes a naturally occurring 
allelic variant of a polypeptide comprising the amino acid sequence of SEQ ID NO:2, 5, 
8, or 1 1 , wherein the nucleic acid molecule hybridizes to a nucleic acid molecule 

30 consisting of SEQ ID NO:l, 3, 4, 6, 7, 9, 10, or 12 under stringent conditions and is 
encoded by the same locus as hVR-1 , hVR-2 or rVR-2. 
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Another embodiment of the invention provides a nucleic acid molecule that 
encodes a naturally occurring orthologue of a polypeptide comprising the amino acid 
sequence of SEQ ID NO:2, 5, 8, or 1 1, wherein the nucleic acid molecule hybridizes to a 
nucleic acid molecule consisting of SEQ ID NO:l, 3, 4, 6, 7, 9, 10, or 12 under stringent 
5 conditions. 

Another embodiment of the invention provides an isolated nucleic acid molecule 
which is antisense to an hVR-1, hVR-2, and rVR-2 nucleic acid molecule, e.g., the 
coding strand of an hVR-1, hVR-2, and rVR-2 nucleic acid molecule. 

Since the hVR2 (the alternate form) and rVR2 sequences represent fragments of 

1 0 the entire coding regions of these genes, another embodiment of the invention provides 
the complete gene sequences. A skilled artisan can readily isolate such molecule using 
the sequences disclosed herein. 

Another aspect of the invention provides a vector comprising an hVR-1, an hVR- 
2, or a rVR-2 nucleic acid molecule. In certain embodiments, the vector is a 

1 5 recombinant expression vector. In another embodiment, the invention provides a host 
cell containing a vector of the invention. In yet another embodiment, the invention 
provides a host cell containing a nucleic acid molecule of the invention. The invention 
also provides a method for producing a protein, preferably an hVR-1, hVR-2, and rVR-2 
protein, by culturing in a suitable medium, a host cell, e.g., a mammalian host cell such 

20 as a non-human mammalian cell, of the invention containing a recombinant expression 
vector, such that the protein is produced. 

Another aspect of this invention features isolated or recombinant hVR-1, hVR-2, 
and rVR-2 proteins and polypeptides. In one embodiment, the isolated protein, 
preferably an hVR-1, hVR-2, or rVR-2 protein, includes at least one transmembrane 

25 domain. In another embodiment, the isolated protein, preferably an hVR-1, hVR-2, or 
rVR-2 protein, includes at least one transmembrane domain and at least one proline rich 
domain. In yet another embodiment, the isolated protein, preferably an hVR-1, hVR-2, 
or rVR-2 protein, includes at least one transmembrane domain, at least one proline rich 
domain, and at least one ankyrin repeat domain. In yet another embodiment, the protein, 

30 preferably an hVR-1, hVR-2, or rVR-2 protein, includes at least one transmembrane 

domain, at least one proline rich domain, and at least one ankyrin repeat domain and has 
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an amino acid sequence at least about 60%, 65%, 70%, 75%, 80%, 85%. 87%, 90%, 
95%, 98% or more homologous to the amino acid sequence of SEQ ID NO:2, 5, 8, or 
1 1 . In another embodiment, the protein, preferably an hVR- 1 , hVR-2, or rVR-2 protein, 
includes at least one transmembrane domain, at least one proline rich domain, and at 

5 least one ankyrin repeat domain and plays a role in the development and regulation of 
pain. In yet another embodiment, the protein, preferably an hVR-1, hVR-2, and rVR-2 
protein, includes at least one transmembrane domain, at least one proline rich domain, 
and at least one ankyrin repeat domain and is encoded by a nucleic acid molecule having 
a nucleotide sequence which hybridizes under stringent hybridization conditions to a 

1 0 nucleic acid molecule comprising the nucleotide sequence of SEQ ID NO: 1 , 3, 4, 6, 7, 9, 
10, or 12. 

In another embodiment, the invention features fragments of the protein having 
the amino acid sequence of SEQ ID NO:2, 5, 8, or 1 1, wherein the fragment comprises 
at least 15, 30, 40, 50, 60, 70, 80, 90, or 100 amino acids {e.g., contiguous amino acids). 

1 5 In another embodiment, the invention features an isolated protein, preferably an 

hVR-1, hVR-2, and rVR-2 protein, which is encoded by a nucleic acid molecule 
consisting of a nucleotide sequence at least about 60%, 65%, 70%, 75%, 80%, 83%, 
85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 
more homologous to a nucleotide sequence of SEQ ID NO:l, 3, 4, 6, 7, 9, 10, or 12, or a 

20 complement thereof. This invention further features an isolated protein, preferably an 
hVR-1, hVR-2, or rVR-2 protein, which is encoded by a nucleic acid molecule 
consisting of a nucleotide sequence which hybridizes under stringent hybridization 
conditions to a nucleic acid molecule consisting of the nucleotide sequence of SEQ ID 
NO:l, 3, 4, 6, 7, 9, 10, or 12, or a complement thereof. 

25 The proteins of the present invention or portions thereof, e.g., biologically active 

portions thereof, can be operatively linked to a non-hVR-1, non-hVR-2, or non-rVR-2 
polypeptide {e.g. , heterologous amino acid sequences) to form fusion proteins. The 
invention further features antibodies, such as monoclonal or polyclonal antibodies, that 
specifically bind proteins of the invention, preferably hVR-1, hVR-2, and rVR-2 
30 proteins. In addition, the hVR- 1 , hVR-2, and rVR-2 proteins or biologically active 
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portions thereof can be incorporated into pharmaceutical compositions, which optionally 
include pharmaceutical ly acceptable carriers. 

In another aspect, the present invention provides a method for detecting the 
presence of an hVR-1, hVR-2, and rVR-2 nucleic acid molecule, protein or polypeptide 
in a biological sample by contacting the biological sample with an agent capable of 
detecting an hVR-1, hVR-2, and rVR-2 nucleic acid molecule, protein or polypeptide 
such that the presence of an hVR-1, hVR-2, and rVR-2 nucleic acid molecule, protein or 
polypeptide is detected in the biological sample. 

In another aspect, the present invention provides a method for detecting the 
presence of hVR-1, hVR-2, and rVR-2 activity in a biological sample by contacting the 
biological sample with an agent capable of detecting an indicator of hVR-1, hVR-2, and 
rVR-2 activity such that the presence of hVR-1, hVR-2, and rVR-2 activity is detected 

in the biological sample. 

In another aspect, the invention provides a method for modulating hVR-1, hVR- 
2. and rVR-2 activity comprising contacting a cell capable of expressing hVR-1, hVR-2, 
and rVR-2 with an agent that modulates hVR-1, hVR-2, and rVR-2 activity such that 
hVR-1, hVR-2, and rVR-2 activity in the cell is modulated. In one embodiment, the 
agent inhibits hVR-1, hVR-2, and rVR-2 activity. In another embodiment, the agent 
stimulates hVR-1, hVR-2, and rVR-2 activity. In one embodiment, the agent is an 
antibody that specifically binds to an hVR-1, hVR-2, and rVR-2 protein. In another 
embodiment, the agent modulates expression of hVR-1, hVR-2, and rVR-2 by 
modulating transcription of an hVR-1, hVR-2, and rVR-2 gene or translation of an hVR- 
1, hVR-2, and rVR-2 mRNA. In yet another embodiment, the agent is a nucleic acid 
molecule having a nucleotide sequence that is antisense to the coding strand of an hVR- 
1, hVR-2, and rVR-2 mRNA or an hVR-1, hVR-2, and rVR-2 gene. 

In one embodiment, the methods of the present invention are used to treat a 
subject having a disorder characterized by aberrant hVR-1 , hVR-2, and rVR-2 protein or 
nucleic acid expression or activity by administering an agent which is an hVR-1 , hVR-2, 
and rVR-2 modulator to the subject. In one embodiment, the hVR-1 , hVR-2, and rVR-2 
modulator is an hVR-1, hVR-2, and rVR-2 protein. In another embodiment the hVR-1, 
hVR-2, and rVR-2 modulator is an hVR-1, hVR-2, and rVR-2 nucleic acid molecule. In 
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yet another embodiment, the hVR-1, hVR-2, and rVR-2 modulator is a peptide, 
peptidomimetic, or other small molecule. In a further embodiment, the disorder 
characterized by aberrant hVR-L hVR-2 ; and rVR-2 protein or nucleic acid expression 
is a pain disorder, e.g. , hyperalgesia. 
5 The present invention also provides a diagnostic assay for identifying the 

presence or absence of a genetic alteration characterized by at least one of (i) aberrant 
modification or mutation of a gene encoding an hVR-1, hVR-2, and rVR-2 protein; (ii) 
mis-regulation of the gene; and (iii) aberrant post-translational modification of an hVR- 
1 , hVR-2, and rVR-2 protein, wherein a wild-type form of the gene encodes a protein 
1 0 with an hVR- 1 , hVR-2, and rVR-2 activity (as described herein). 

In another aspect the invention provides a method for identifying a compound 
that binds to or modulates the activity of an hVR-1, hVR-2, and rVR-2 protein, by 
providing an indicator composition comprising an hVR-1, hVR-2, and rVR-2 protein 
having hVR-l ? hVR-2, and rVR-2 activity, contacting the indicator composition with a 
1 5 test compound, and determining the effect of the test compound on hVR-1 , hVR-2, and 
rVR-2 activity in the indicator composition to identify a compound that modulates the 
activity of an hVR-1, hVR-2, and rVR-2 protein. 

Other features and advantages of the invention will be apparent from the 
following detailed description and claims. 

20 

Brief Description of the Drawings 

Figure I depicts the full length cDNA sequence and predicted amino acid 
sequence of human VR-1 (hVR-1). The nucleotide sequence corresponds to nucleic 
acids 1 to 3909 of SEQ ID NO:l . The amino acid sequence corresponds to amino acids 

25 1 to 839 of SEQ ID NO:2. The coding region without the 5' and 3' untranslated regions 
of the human VR-1 (hVR-1) gene is shown in SEQ ID NO:3. 

Figure 2 depicts the full length cDNA sequence and predicted amino acid 
sequence of human VR-2 (hVR-2). The nucleotide sequence corresponds to nucleic 
acids 1 to 2809 of SEQ ID NO:4. The amino acid sequence corresponds to amino acids 

30 1 to 764 of SEQ ID NO:5. The coding region without the 5' and 3 f untranslated regions 
of the human VR-2 (hVR-2) gene is shown in SEQ ID NO:6. 
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Figure 3 depicts the partial cDN A sequence and partial predicted amino acid 
sequence of an alternate form of human VR-2 (hVR-2). The nucleotide sequence 
corresponds to nucleic acids 1 to 1489 of SEQ ID NO:7. The amino acid sequence 
corresponds to amino acids 1 to 436 of SEQ ID NO: 8. The coding region without the 5 1 
5 and 3' untranslated regions of the alternate form of human VR-2 (hVR-2) gene is shown 
in SEQ ID NO:9. 

Figure 4 depicts the partial cDNA sequence and partial predicted amino acid 
sequence of rat VR-2 (rVR-2). The nucleotide sequence corresponds to nucleic acids 1 
to 1 794 of SEQ ID NO: 1 0. The amino acid sequence corresponds to amino acids 1 to 

10 554 of SEQ ID NO: 1 1 . The coding region without the 5' and 3' untranslated regions of 
the rat VR-2 (rVR-2) gene is shown in SEQ ID NO: 12. 

Figure 5 depicts an alignment of the hVR-1 protein (SEQ ID NO:2) with the 
human VR-2 protein (SEQ ID NO:5) using the GAP program in the GCG software 
package (Blosum 62 matrix) and a gap weight of 12 and a length weight of 4. 

1 5 Figure 6 depicts an alignment of the hVR- 1 nucleotide sequence (SEQ ID NO: 1 ) 

with the human VR-2 nucleotide sequence (SEQ ID NO:4) using the GAP program in 
the GCG software package (nwsgapdna matrix) and a gap weight of 50 and a length 
weight of 3. 

Figure 7 depicts an alignment of the hVR-2 protein (SEQ ID NO:5) with the rat 
20 VR-2 protein (SEQ ID NO: 1 1 ) using the CLUSTAL W (1 .74) multiple sequence 
alignment program. 

Figure 8 depicts an alignment of the hVR-2 protein (SEQ ID NO:5) with the rat 
VR-2 protein (SEQ ID NO:l 1) using the GAP program in the GCG software package 
(Blosum 62 matrix) and a gap weight of 12 and a length weight of 4. 
25 Figure 9 depicts an alignment of the hVR- 1 nucleotide sequence (SEQ ID NO: 1 ) 

with the rat VR-1 nucleotide sequence (Accession Number: AF0293 10) using the GAP 
program in the GCG software package (nwsgapdna matrix) and a gap weight of 50 and a 
length weight of 3. 

Figure 10 depicts an alignment of the hVR-1 protein (SEQ ID NO:2) with the rat 
30 VR- 1 protein (Accession Number: AF0293 1 0) using the GAP program in the GCG 

software package (Blosum 62 matrix) and a gap weight of 12 and a length weight of 4. 
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Figure I 1 depicts an alignment of the hVR-2 protein (SEQ ID NO:5) with the 
human VR-2 protein (alternate form) (SEQ ID NO:8) using the CLUSTAL W (1.74) 
multiple sequence alignment program. 

Figure 12 depicts a structural, hydrophobicity, and antigenicity analysis of the 

5 hVR-1 protein. 

Figure 13 depicts the results of a search using the amino acid sequence of the 
hVR-1 protein against the HMM database. 

Figure 14 depicts a structural, hydrophobicity, and antigenicity analysis of the 

hVR-2 protein. 

1 0 Figure 15 depicts the results of a search using the amino acid sequence of the 

hVR-2 protein against the HMM database. 

Figure 16 depicts the predicted full length amino acid sequence of the human 
VR-2 protein (alternate form) (SEQ ID NO:20). 

Figure 1 7 depicts an alignment of the hVR-2 protein (SEQ ID NO:5) with the 
1 5 predicted full length human VR-2 protein (alternate form) (SEQ ID NO:20) using the 
CLUSTAL W (1 .74) multiple sequence alignment program. 

Detailed Description of the Invention 

The present invention is based, at least in part, on the discovery of nucleic acid 
20 and amino acid molecules which are novel members of the Capsaicin/Vanilloid family 
of receptors. Described herein is the isolation of the human orthologue of rat VR-1 
(rVR-1), referred to herein as hVR-1, as well as another previously unknown member of 
the VR family of receptors, referred herein as VR-2, and specifically as human VR-2 
(hVR-2) and rat VR-2 (rVR-2) nucleic acid and protein molecules. The hVR-1, hVR-2, 
25 and rVR-2 molecules were identified based on their sequence similarity to the known rat 
vanilloid receptor (VR-1). VR-1 is a vanilloid gated, non-selective cation channel 
which resembles members of the transient receptor potential (TRP) ion channel family 
(described in Montell et al (1989) Neuron 2:1313-1323) that mediate the influx of 
extracellular calcium in response to depletion of intracellular calcium stores. The rat 
30 VR-1 cDNA contains an open reading frame of 25 14 nucleotides that encodes a protein 
of 838 amino acids. Hydrophilicity analysis has indicated that rat VR-1 contains six 
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transmembrane domains (predicted to be mostly a-helices) with an additional short 
hydrophobic stretch between transmembrane regions 5 and 6. The amino terminal 
hydrophilic segment contains a relatively proline rich region followed by three ankyrin 
repeat domains. The rat VR-1 is expressed in small diameter neurons within sensory 

5 ganglia. The present hVR-1 sequence is the human orthologue of rVR-1. As described 
in further detail infra, the human VR-1 is expressed in nodose, trigeminal sensory 
neurons, as well as in some, but not all, small dorsal root ganglion (DRG) neurons and 
in a few medium sized DRG neurons. 

The hVR-1, hVR-2, and rVR-2 molecules of the present invention play a role in 

1 0 pain signaling mechanisms. As used herein, the term "pain signaling mechanisms" 
includes the cellular mechanisms involved in the development and regulation of pain, 
e.g., pain elicited by noxious chemical, mechanical, or thermal stimuli, in a subject, e.g., 
a mammal such as a human. In mammals, the initial detection of noxious chemical, 
mechanical, or thermal stimuli, a process referred to as "nociception", occurs 

1 5 predominantly at the peripheral terminals of specialized, small diameter primary afferent 
neurons, called polymodal nociceptors. These afferent neurons transmit the information 
to the central nervous system, evoking a perception of pain or discomfort and initiating 
appropriate protective reflexes. Capsaicin/Vanilloid receptors, e.g., the hVR-1, hVR-2, 
and rVR-2 molecules of the present invention, present on these afferent neurons, are 

20 involved in detecting these noxious chemical, mechanical, or thermal stimuli and 

transducing this information into membrane depolarization events. Thus, the hVR-1, 
hVR-2, and rVR-2 molecules by participating in pain signaling mechanisms, can 
modulate pain elicitation and provide novel diagnostic targets and therapeutic agents to 
control pain. 

25 The hVR- 1 , hVR-2, and rVR-2 molecules provide novel diagnostic targets and 

therapeutic agents to control pain in a variety of disorders, diseases, or conditions which 
are characterized by a deregulated, e.g., upregulated or downregulated, pain response. 
For example, the hVR-1, hVR-2, and rVR-2 molecules provide novel diagnostic targets 
and therapeutic agents to control the exaggerated pain response elicited during various 

30 forms of tissue injury, e.g. , inflammation, infection, and ischemia, usually referred to as 
hyperalgesia (described in, for example, Fields, H.L. (1987) Pain, New York:McGraw- 
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Hill). Moreover, the hVR-1, hVR-2 ? and rVR-2 molecules provide novel diagnostic 
targets and therapeutic agents to control pain associated with muscoloskeletal disorders, 
e.g., joint pain; tooth pain; headaches; pain associated with surgery, or neuropathic pain. 
As the hVR-1 gene maps to a region of human chromosome 17 between WI- 
5 5436 (7.7cR) and WI-6584 (1 8.9cR) (Example 6), which has been associated with 

myasthenia gravis, Smith-Magenis syndrome, CORDS, Cone-rod dysrtophy, and breast 
cancer, the hVR-1 molecule may provide novel diagnostic targets and therapeutic agents 
to treat, diagnose, or prognose these disorders or other disorders linked to this 
chromosomal region. Similarly, as the hVR-2 gene maps to a region of human 

10 chromosome 17 between AFMA043ZB5 (23.3 cR) and D17S721 (29.3cR) (Example 6) 
which has been associated with myasthenia gravis, Smith-Magenis syndrome, CORD5, 
Cone-rod dysrtophy, choroidal dystrophy, central areolar, and retinal cone dystrophy, 
the hVR-2 molecule may provide novel diagnostic targets and therapeutic agents to treat, 
diagnose, or prognose these disorders or other disorders linked to this chromosomal 

1 5 region. 

The term "family" when referring to the protein and nucleic acid molecules of 
the invention is intended to mean two or more proteins or nucleic acid molecules having 
a common structural domain or motif and having sufficient amino acid or nucleotide 
sequence homology as defined herein. Such family members can be naturally or non- 
20 naturally occurring and can be from either the same or different species. For example, a 
family can contain a first protein of human origin, as well as other, distinct proteins of 
human origin or alternatively, can contain homologues of non-human origin. Members 
of a family may also have common functional characteristics. 

For example, the family of hVR-1, hVR-2, and rVR-2 proteins comprise at least 
25 one, and preferably six "transmembrane domains." As used herein, the term 

"transmembrane domain" includes an amino acid sequence of about 1 5 amino acid 
residues in length which spans the plasma membrane. More preferably, a 
transmembrane domain includes about at least 20, 25, 30, 35, 40, or 45 amino acid 
residues and spans the plasma membrane. Transmembrane domains are rich in 
30 hydrophobic residues, and typically have a helical structure. In a embodiment, at least 
50%, 60%, 70%, 80%, 90%, 95% or more of the amino acid residues of a 
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transmembrane domain are hydrophobic, e.g., leucines, isoleucines, tyrosines, or 
tryptophans. Transmembrane domains are described in, for example. Zagotta W.N. et 
al, (1996) Annuo! Rev. Neurosci. 19: 235-63, the contents of which are incorporated 
herein by reference. Amino acid residues 434-455, 480-495, (509-53 1 ; based on 
5 homology to the rat VR-1) or 514-531, (543-569; based on homology to the rat VR-1) or 
538-555, (577-596; based on homology to the rat VR-1) or 580-599, and (656-683; 
based on homology to the rat VR-1) or 658-682 of hVR-1 (SEQ ID NO:2) and amino 
acid residues 391-410, 431-448, 459-476, 486-508, 538-556, and 621-645 of hVR-2 
(SEQ ID NO: 5) comprise transmembrane domains. 

10 In another embodiment, an hVR-1, hVR-2, and rVR-2 of the present invention is 

identified based on the presence of a "proline rich domain" in the protein or 
corresponding nucleic acid molecule. As used herein, the term "proline rich domain" 
includes an amino acid sequence of about 4-6 amino acid residues in length having the 
general sequence X-Pro-X-X-Pro-X (where X can be any amino acid). Proline rich 

15 domains are usually located in a helical structure and bind through hydrophobic 
interactions to SH3 domains. SH3 domains recognize proline rich domains in both 
forward and reverse orientations. Proline rich domains are described in, for example, 
Sattler M. et al. (1998) Leukemia 12:637-644, the contents of which are incorporated 
herein by reference. 

20 In another embodiment, an hVR-1, hVR-2, and rVR-2 of the present invention is 

identified based on the presence of an "ankyrin repeat domain" in the protein or 
corresponding nucleic acid molecule. As used herein, the term "ankyrin repeat domain" 
includes a protein domain having an amino acid sequence of about 30-50 amino acid 
residues and having a bit score for the alignment of the sequence to the ankyrin repeat 

25 domain (HMM) of at least 6. Preferably, an ankyrin repeat domain includes at least 
about 30-45, more preferably about 30-40 amino acid residues, or about 30-35 amino 
acids and has a bit score for the alignment of the sequence to the ankyrin repeat domain 
(HMM) of at least 3-10, more preferably 10-30, more preferably 30-50, even more 
preferably 50-75, 75-100, 100-200 or greater. The ankyrin repeat domain HMM has 

30 been assigned the PFAM Accession PF00023 (http://genome.wustl.edu/Pfam/.html). 
Ankyrin repeats are involved in protein-protein interactions and are described in, for 
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example, Ketchum K.A et al (1996) FEES Letters 378: 1 9-26, the contents of which are 
incorporated herein by reference. 

To identify the presence of an ankyrin repeat domain in an hVR-1 , hVR-2, and 
rVR-2 protein and make the determination that a protein of interest has a particular 
5 profile, the amino acid sequence of the protein is searched against a database of HMMs 
{e.g., the Pfam database, release 2.1) using the default parameters 
(http://www.sanger.ac.uk/Software/Pfam/HMM_search). A description of the Pfam 
database can be found in Sonhammer et al (1997) Proteins 28(3)405-420 and a detailed 
description of HMMs can be found, for example, in Gribskov et al (1990) Meth 

10 Enzymol 183:146-159; Gribskov et al (1987) Proc. Natl Acad. Scl USA 84:4355- 
4358; Krogh et al ( 1 994) J. Mol Biol 235:1501-153 1; and Stultz et al{\993) Protein 
Sci. 2:305-314. the contents of which are incorporated herein by reference. A search 
was performed against the HMM database resulting in the identification of three ankyrin 
repeat domains in the amino acid sequence of SEQ ID NO:2 (at about residues 201-233, 

15 248-283, and 333-361) and SEQ ID NO:5 (at about residues 162-194, 208-243, and 293- 
328). The results of the searches are set forth in Figures 13 and 15. 

Isolated proteins of the present invention, preferably hVR-1, hVR-2, and rVR-2 
proteins, have an amino acid sequence sufficiently identical to the amino acid sequence of 
SEQ ID NO:2, 5, 8, or 1 1 or are encoded by a nucleotide sequence sufficiently identical to 

20 SEQ ID NO:l, 3, 4, 6, 7, 9, 10, or 12. As used herein, the term "sufficiently identical" 

refers to a first amino acid or nucleotide sequence which contains a sufficient or minimum 
number of identical or equivalent (e.g., an amino acid residue which has a similar side 
chain) amino acid residues or nucleotides to a second amino acid or nucleotide sequence 
such that the first and second amino acid or nucleotide sequences share common structural 

25 domains or motifs and/or a common functional activity. For example, amino acid or 

nucleotide sequences which share common structural domains have at least 30%, 40%, or 
50% identity, preferably 60% identity, more preferably 70%-80%, and even more 
preferably 90-95% identity across the amino acid sequences of the domains and contain at 
least one and preferably two structural domains or motifs, are defined herein as sufficiently 

30 identical. Furthermore, amino acid or nucleotide sequences which share at least 30%, 40%, 
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or 50%, preferably 60%, more preferably 70-80%, or 90-95% identity and share a common 
functional activity are defined herein as sufficiently identical. 

As used interchangeably herein, an "hVR-l, hVR-2, and rVR-2 activity", "biological 
activity of hVR-1, hVR-2, and rVR-2" or "functional activity of hVR-1, hVR-2, and rVR-2", 
5 refers to an activity exerted by an hVR-l ? hVR-2, and rVR-2 protein, polypeptide or nucleic 
acid molecule on an hVR-1, hVR-2, and rVR-2 responsive cell or on an hVR-1, hVR-2, and 
rVR-2 protein substrate, as determined in vivo, or in vitro, according to standard techniques. 
In one embodiment, an hVR-1, hVR-2, and rVR-2 activity is a direct activity, such as an 
association with an hVR-1, hVR-2, and rVR-2-target molecule. As used herein, a "target 

10 molecule" or "binding partner" is a molecule with which an hVR-1, hVR-2, and rVR-2 

protein binds or interacts in nature, such that hVR-1, hVR-2, and rVR-2-mediated function is 
achieved. An hVR-U hVR-2, and rVR-2 target molecule can be a non-hVR-U non-hVR-2, 
and non-rVR-2 molecule or an hVR-1 , hVR-2, and rVR-2 protein or polypeptide of the 
present invention. In an exemplary embodiment, an hVR-1, hVR-2, and rVR-2 target 

15 molecule is an hVR-1, hVR-2, and rVR-2 ligand, e.g., capsaicin. Alternatively, an hVR-1, 
hVR-2, and rVR-2 activity is an indirect activity, such as a cellular signaling activity 
mediated by interaction of the hVR-1, hVR-2, and rVR-2 protein with an hVR-K hVR-2, and 
rVR-2 ligand. 

Accordingly, another embodiment of the invention features isolated hVR-1, 
20 hVR-2, and rVR-2 proteins and polypeptides having an hVR-1 , hVR-2, and rVR-2 

activity. Other proteins of the invention are hVR-1, hVR-2, and rVR-2 proteins having 
at least one, and preferably six, transmembrane domains and, preferably, an hVR-1, 
hVR-2, and rVR-2 activity. Yet other proteins of the invention are hVR-1 , hVR-2, and 
rVR-2 proteins having at least one transmembrane domain, at least one proline rich 
25 domain and, preferably, an hVR-1 , hVR-2, and rVR-2 activity. Other proteins of the 
invention are hVR-1, hVR-2, and rVR-2 proteins having at least one transmembrane 
domain, at least one proline rich domain, at least one ankyrin repeat domain and, 
preferably, an hVR-1, hVR-2, and rVR-2 activity. Additional proteins of the invention 
have at least one transmembrane domain, at least one proline rich domain, at least one 
30 ankyrin repeat domain, and are, preferably, encoded by a nucleic acid molecule having a 
nucleotide sequence which hybridizes under stringent hybridization conditions to a 



BNSDOCID: <WO 0029577A1_I_> 



WO 00/29577 PCT/US99/2670I 

- 16- 

nucleic acid molecule comprising the nucleotide sequence of SEQ ID NO:l, 3, 4, 6, 7, 9, 
10, or 12. 

The nucleotide sequence of the full length hVR-1 cDNA and the predicted 
amino acid sequence of the hVR-1 polypeptide are shown in Figure 1 and in SEQ ID 
5 NOs: 1 and 2, respectively. 

The nucleotide sequence of the full length hVR-2 cDNA and the predicted amino 
acid sequence of the hVR-2 polypeptide are shown in Figure 2 and in SEQ ID NOs:4 
and 5, respectively. 

The nucleotide sequence of the partial hVR-2 (alternate form) cDNA and the 
10 predicted amino acid sequence of the hVR-2 (alternate form) polypeptide are shown in 
Figure 3 and in SEQ ID NOs: 7 and 8, respectively. 

The nucleotide sequence of the partial rVR-2 cDNA and the predicted amino 
acid sequence of the rVR-2 polypeptide are shown in Figure 4 and in SEQ ID NOs: 10 
and 1 1, respectively. 

15 Various aspects of the invention are described in further detail in the following 

subsections: 

I. Isolated Nucleic Acid Molecules 

One aspect of the invention pertains to isolated nucleic acid molecules that 

20 encode hVR-1 , hVR-2, and rVR-2 proteins or biologically active portions thereof, as 

well as nucleic acid fragments sufficient for use as hybridization probes to identify hVR- 
1, hVR-2, and rVR-2-encoding nucleic acid molecules (e.g., hVR-1, hVR-2, and rVR-2 
mRNA) and fragments for use as PCR primers for the amplification or mutation of hVR- 
1, hVR-2, and rVR-2 nucleic acid molecules. As used herein, the term "nucleic acid 

25 molecule" is intended to include DNA molecules (e.g., cDNA or genomic DNA) and 
RNA molecules (e.g., mRNA) and analogs of the DNA or RNA generated using 
nucleotide analogs. The nucleic acid molecule can be single-stranded or double- 
stranded, but preferably is double-stranded DNA. 

The term "isolated nucleic acid molecule" includes nucleic acid molecules which 

30 are separated from other nucleic acid molecules which are present in the natural source 
of the nucleic acid. For example, with regards to genomic DNA, the term "isolated" 
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includes nucleic acid molecules which are separated from the chromosome with which 
the genomic DNA is naturally associated. Preferably, an "isolated" nucleic acid is free 
of sequences which naturally flank the nucleic acid (i.e., sequences located at the 5' and 
3' ends of the nucleic acid) in the genomic DNA of the organism from which the nucleic 
5 acid is derived. For example, in various embodiments, the isolated hVR-1, hVR-2, and 
rVR-2 nucleic acid molecule can contain less than about 5 kb, 4kb, 3kb, 2kb, 1 kb, 0.5 
kb or 0.1 kb of nucleotide sequences which naturally flank the nucleic acid molecule in 
genomic DNA of the cell from which the nucleic acid is derived. Moreover, an 
"isolated" nucleic acid molecule, such as a cDNA molecule, can be substantially free of 
10 other cellular material, or culture medium when produced by recombinant techniques, or 
substantially free of chemical precursors or other chemicals when chemically 
synthesized. 

A nucleic acid molecule of the present invention, e.g., a nucleic acid molecule 
having the nucleotide sequence of SEQ ID NO:l, 3, 4, 6, 7, 9, 10, or 12. Using all or 

15 portion of the nucleic acid sequence of SEQ ID NO:l, 3, 4, 6, 7, 9, 10, or 12, as a 

hybridization probe. hVR-1, hVR-2, and rVR-2 nucleic acid molecules can be isolated 
using standard hybridization and cloning techniques (e.g., as described in Sambrook, J., 
Fritsh, E. F., and Maniatis, T. Molecular Cloning: A Laboratory Manual. 2nd, ed. y Cold 
Spring Harbor Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, 

20 NY, 1989). 

Moreover, a nucleic acid molecule encompassing all or a portion of SEQ ID 
NO:l, 3, 4, 6, 7, 9, 10, or 12, can be isolated by the polymerase chain reaction (PCR) 
using synthetic oligonucleotide primers designed based upon the sequence of SEQ ID 
NO:l,3,4, 6, 7, 9, 10, or 12. 

25 A nucleic acid of the invention can be amplified using cDNA, mRNA or 

alternatively, genomic DNA, as a template and appropriate oligonucleotide primers 
according to standard PCR amplification techniques. The nucleic acid so amplified can 
be cloned into an appropriate vector and characterized by DNA sequence analysis. 
Furthermore, oligonucleotides corresponding to hVR-1, hVR-2, and rVR-2 nucleotide 

30 sequences can be prepared by standard synthetic techniques, e.g., using an automated 
DNA synthesizer. 
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In one embodiment, an isolated nucleic acid molecule of the invention comprises 
the nucleotide sequence shown in SEQ ID NO: 1 . The sequence of SEQ ID NO: 1 
corresponds to the full length hVR-1 encoding cDNA. 

In another embodiment, an isolated nucleic acid molecule of the invention 
5 comprises the nucleotide sequence shown in SEQ ID NO:4. The sequence of SEQ ID 
NO:4 corresponds to the full length hVR-2 encoding cDNA. 

In another embodiment, an isolated nucleic acid molecule of the invention 
comprises the nucleotide sequence shown in SEQ ID NO:7. The sequence of SEQ ID 
NO:7 corresponds to a fragment of the hVR-2 (alternate form) encoding cDNA. 
10 In another embodiment, an isolated nucleic acid molecule of the invention 

comprises the nucleotide sequence shown in SEQ ID NO:10. The sequence of SEQ ID 
NO: 10 corresponds to a fragment of the rVR-2 cDNA. 

In another embodiment, an isolated nucleic acid molecule of the invention 
comprises a nucleic acid molecule which is a complement of the nucleotide sequence 
15 shown in SEQ ID NO:l, 3, 4, 6, 7, 9, 10, or 12, or a portion of any of these nucleotide 
sequences. A nucleic acid molecule which is complementary to the nucleotide sequence 
shown in SEQ ID NO:l, 3, 4, 6, 7, 9, 10, or 12, is one which is sufficiently 
complementary to the nucleotide sequence shown in SEQ ID NO:l, 3, 4, 6, 7, 9, 10, or 
12, such that it can hybridize to the nucleotide sequence shown in SEQ ID NO:l, 3, 4, 6, 
20 7, 9, 10, or 12 thereby forming a stable duplex. 

In still another embodiment, an isolated nucleic acid molecule of the present 
invention comprises a nucleotide sequence which is at least about 60%, 65%, 70%, 75%, 
80%, 83%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 
98%, 99% or more homologous to the entire length of the nucleotide sequence shown in 
25 SEQIDNO:l,3,4,6, 7, 9, 10, or 12, or a portion of any of these nucleotide sequences. 

Moreover, the nucleic acid molecule of the invention can comprise only a portion 
of the nucleic acid sequence of SEQ ID NO:l, 3, 4, 6, 7, 9, 10, or 12, for example, a 
fragment which can be used as a probe or primer or a fragment encoding a portion of an 
hVR-1, hVR-2, and rVR-2 protein, e.g., a biologically active portion of an hVR-1, hVR- 
30 2, and rVR-2 protein. The nucleotide sequence determined from the cloning of the 
hVR-1, hVR-2, and rVR-2 gene allows for the generation of probes and primers 
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designed for use in identifying and/or cloning other hVR-1, hVR-2. and rVR-2 family 
members, as well as hVR-1, hVR-2, and rVR-2 homologues from other species. The 
probe/primer typically comprises a substantially purified oligonucleotide. The 
oligonucleotide typically comprises a region of nucleotide sequence that hybridizes 
5 under stringent conditions to at least about 12 or 15, preferably about 20 or 25, more 
preferably about 30, 35, 40, 45, 50, 55, 60, 65, 75, or 100 consecutive nucleotides of a 
sense sequence of SEQ ID NO:l, 3, 4, 6, 7, 9, 10, or 12, of an anti-sense sequence of 
SEQ ID NO:l, 3, 4, 6, 7, 9, 10, or 12, or of a naturally occurring allelic variant or mutant 
of SEQ ID NO:l, 3, 4, 6, 7, 9, 10, or 12. In an exemplary embodiment, a nucleic acid 

1 0 molecule of the present invention comprises a nucleotide sequence which is greater than 
100-150. 150-200. 200-250, 250-300, 300-350, 350-400, 400-450, 450-500, 500-550, 
550-600. 600-650, 650-700, 700-750, 750-800, 800-850, 850-900, 900-950, 950-1000, 
1088, or more nucleotides in length and hybridizes under stringent hybridization 
conditions to a nucleic acid molecule of SEQ ID NO:l, 3, 4, 6, 7, 9, 10, or 12. 

1 5 Probes based on the hVR-1, hVR-2, and rVR-2 nucleotide sequences can be used 

to detect transcripts or genomic sequences encoding the same or homologous proteins. 
In preferred embodiments, the probe further comprises a label group attached thereto, 
e.g., the label group can be a radioisotope, a fluorescent compound, an enzyme, or an 
enzyme co-factor. Such probes can be used as a part of a diagnostic test kit for 

20 identifying cells or tissue which misexpress an hVR-1, hVR-2, and rVR-2 protein, such 
as by measuring a level of an hVR-1, hVR-2, and rVR-2-encoding nucleic acid in a 
sample of cells from a subject e.g., detecting hVR-1, hVR-2, and rVR-2 mRNA levels or 
determining whether a genomic hVR-1, hVR-2, and rVR-2 gene has been mutated or 
deleted. 

25 A nucleic acid fragment encoding a "biologically active portion of an hVR-1, 

hVR-2, and rVR-2 protein" can be prepared by isolating a portion of the nucleotide 
sequence of SEQ ID NO:l, 3, 4, 6, 7, 9, 10, or 12, which encodes a polypeptide having 
an hVR-1, hVR-2, and rVR-2 biological activity (the biological activities of the hVR-1, 
hVR-2, and rVR-2 proteins are described herein), expressing the encoded portion of the 

30 hVR-1, hVR-2, and rVR-2 protein (e.g., by recombinant expression in vitro) and 

assessing the activity of the encoded portion of the hVR-1, hVR-2, and rVR-2 protein. 
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The invention further encompasses nucleic acid molecules that differ from the 
nucleotide sequence shown in SEQ ID NO:l, 3, 4, 6, 7, 9, 10, or 12, due to degeneracy 
of the genetic code and thus encode the same hVR-1, hVR-2, and rVR-2 proteins as 
those encoded by the nucleotide sequence shown in SEQ ID NO:l, 3, 4, 6, 7, 9, 10, or 
5 12. In another embodiment, an isolated nucleic acid molecule of the invention has a 
nucleotide sequence encoding a protein having an amino acid sequence shown in SEQ 
IDNO:2, 5, 8, or 11. 

In addition to the hVR-1, hVR-2, and rVR-2 nucleotide sequences shown in SEQ 
ID NO:l, 3, 4, 6, 7, 9, 10, or 12, it will be appreciated by those skilled in the art that 

10 DNA sequence polymorphisms that lead to changes in the amino acid sequences of the 
hVR-1, hVR-2, and rVR-2 proteins may exist within a population (e.g., the human 
population). Such genetic polymorphism in the hVR-1, hVR-2, and rVR-2 genes may 
exist among individuals within a population due to natural allelic variation. As used 
herein, the terms "gene" and "recombinant gene" refer to nucleic acid molecules which 

15 include an open reading frame encoding an hVR-1, hVR-2, and rVR-2 protein, 

preferably a mammalian hVR-l ? hVR-2, and rVR-2 protein, and can further include non- 
coding regulatory sequences, and introns. 

Allelic variants of hVR-1, hVR-2, and rVR-2 include both functional and non- 
functional hVR-1, hVR-2, and rVR-2 proteins. Functional allelic variants are naturally 

20 occurring amino acid sequence variants of the hVR-1, hVR-2, and rVR-2 protein that 
maintain the ability to bind an hVR-1, hVR-2 ? and rVR-2 ligand and/or modulate a pain 
signaling mechanism. Functional allelic variants will typically contain only 
conservative substitution of one or more amino acids of SEQ ID NO:2, 5, 8, or 1 1, or 
substitution, deletion or insertion of non-critical residues in non-critical regions of the 

25 protein. 

Non-functional allelic variants are naturally occurring amino acid sequence 
variants of the hVR-1, hVR-2, and rVR-2 protein that do not have the ability to either 
bind an hVR-1, hVR-2, and rVR-2 ligand and/or modulate a pain signaling mechanism. 
Non-functional allelic variants will typically contain a non-conservative substitution, a 
30 deletion, or insertion or premature truncation of the amino acid sequence of SEQ ID 
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NO:2, 5, 8, or 1 1, or a substitution, insertion or deletion in critical residues or critical 
regions. 

The present invention further provides non-human orthologues of the hVR-2 and 
rVR-2 protein. Orthologues of the hVR-2 and rVR-2 protein are proteins that are 
5 isolated from non-human and non-rat organisms and possess the same hVR-2 and rVR- 
2 ligand binding and/or modulation of pain signaling mechanism capabilities of the 
hVR-2 and rVR-2 proteins. Orthologues of the hVR-2 and rVR-2 proteins can readily 
be identified as comprising an amino acid sequence that is substantially homologous to 
SEQ ID NO: 4, 6, 8 or 10. 

10 Moreover, nucleic acid molecules encoding other hVR-1 , hVR-2, and rVR-2 

family members and, thus, which have a nucleotide sequence which differs from the 
hVR-1, hVR-2, and rVR-2 sequences of SEQ ID NO:l, 3, 4, 6, 7, 9, 10, or 12, are 
intended to be within the scope of the invention. For example, another hVR-1, hVR-2, 
and rVR-2 cDNA can be identified based on the nucleotide sequence of hVR-1, hVR-2, 

15 and rVR-2. Moreover, nucleic acid molecules encoding VR-2 proteins from different 
species, and which, thus, have a nucleotide sequence which differs from the hVR-2 and 
rVR-2 sequences of SEQ ID NO:4, 6, 8, or 10 are intended to be within the scope of the 
invention. For example, a mouse hVR-2 cDNA can be identified based on the 
nucleotide sequence of the human VR-2 (hVR-2) or the rat VR-2 (rVR-2). 

20 Nucleic acid molecules corresponding to natural allelic variants and homologues 

of the hVR-1, hVR-2, and rVR-2 cDNAs of the invention can be isolated based on their 
homology to the hVR-1, hVR-2, and rVR-2 nucleic acids disclosed herein using the 
cDNAs disclosed herein, or a portion thereof, as a hybridization probe according to 
standard hybridization techniques under stringent hybridization conditions. Nucleic acid 

25 molecules corresponding to natural allelic variants and homologues of the hVR-l ? hVR- 
2, and rVR-2 cDNAs of the invention can further be isolated by mapping to the same 
chromosome or locus as the hVR-1, hVR-2, and rVR-2 gene. 

Accordingly, in another embodiment, an isolated nucleic acid molecule of the 
invention is at least 1 5, 20, 25, 30 or more nucleotides in length and hybridizes under 

30 stringent conditions to the nucleic acid molecule comprising the nucleotide sequence of 
SEQ ID NO:l, 3, 4, 6, 7, 9, 10, or 12. In other embodiment, the nucleic acid is at least 
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30, 50. 100, 150, 200. 250, 300, 350, 400, 450, 500, 550, 600, 650, 700, 750, 800, 850, 
900, or 950 nucleotides in length. As used herein, the term "hybridizes under stringent 
conditions' 1 is intended to describe conditions for hybridization and washing under 
which nucleotide sequences at least 60% identical to each other typically remain 
5 hybridized to each other. Preferably, the conditions are such that sequences at least 

about 70%, more preferably at least about 80%, even more preferably at least about 85% 
or 90% identical to each other typically remain hybridized to each other. Such stringent 
conditions are known to those skilled in the art and can be found in Current Protocols in 
Molecular Biology, John Wiley & Sons, N.Y. (1989), 6.3.1-6.3.6. A preferred, non- 
10 limiting example of stringent hybridization conditions are hybridization in 6X sodium 
chloride/sodium citrate (SSC) at about 45°C, followed by one or more washes in 0.2 X 
SSC. 0.1% SDS at 50°C, preferably at 55°C, more preferably at 60°C and even more 
preferably at 65°C. Preferably, an isolated nucleic acid molecule of the invention that 
hybridizes under stringent conditions to the sequence of SEQ ID NO:l, 3, 4, 6, 7, 9, 10, 

15 or 1 2 corresponds to a naturally-occurring nucleic acid molecule. As used herein, a 

"naturally-occurring" nucleic acid molecule refers to an RNA or DNA molecule having 
a nucleotide sequence that occurs in nature {e.g., encodes a natural protein). 

In addition to naturally-occurring allelic variants of the hVR-1, hVR-2, and rVR-2 
sequences that may exist in the population, the skilled artisan will further appreciate that 

20 changes can be introduced by mutation into the nucleotide sequences of SEQ ID NO: 1 , 3, 
4, 6, 7, 9, 10, or 12, thereby leading to changes in the amino acid sequence of the encoded 
hVR-1, hVR-2, and rVR-2 proteins, without altering the functional ability of the hVR-1, 
hVR-2, and rVR-2 proteins. For example, nucleotide substitutions leading to amino acid 
substitutions at "non-essential" amino acid residues can be made in the sequence of SEQ 

25 ID NO:l, 3, 4, 6, 7, 9, 10, or 12. A "non-essential" amino acid residue is a residue that 
can be altered from the wild-type sequence of hVR-1, hVR-2, and rVR-2 {e.g., the 
sequence of SEQ ID NO:2, 5, 8, or 1 1) without altering the biological activity, whereas 
an "essential" amino acid residue is required for biological activity. For example, amino 
acid residues that are conserved among the hVR-1 , hVR-2, and rVR-2 proteins of the 

30 present invention, are predicted to be particularly unamenable to alteration. Furthermore, 
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additional amino acid residues that are conserved between the hVR-1, hVR-2. and rVR-2 
proteins of the present invention and other members of the Capsaicin/Vanilloid receptor 
family are not likely to be amenable to alteration. 

Accordingly, another aspect of the invention pertains to nucleic acid molecules 
5 encoding hVR-1, hVR-2, and rVR-2 proteins that contain changes in amino acid residues 
that are not essential for activity. Such hVR-1, hVR-2, and rVR-2 proteins differ in 
amino acid sequence from SEQ ID NO:2, 5, 8, or 1 1, yet retain biological activity. In 
one embodiment, the isolated nucleic acid molecule comprises a nucleotide sequence 
encoding a protein, wherein the protein comprises an amino acid sequence at least about 
10 60%, 65%, 70%, 75%, 80%, 85%, 87%, 90%, 95%, 98% or more homologous to .SEQ 
IDNO:2. 5, 8, or 11. 

An isolated nucleic acid molecule encoding an hVR-1, hVR-2, and rVR-2 protein 
homologous to the protein of SEQ ID NO:2, 5, 8, or 1 1 can be created by introducing one 
or more nucleotide substitutions, additions or deletions into the nucleotide sequence of 

15 SEQ ID NO: 1, 3, 4, 6, 7, 9, 10, or 12, such that one or more amino acid substitutions, 
additions or deletions are introduced into the encoded protein. Mutations can be 
introduced into SEQ ID NO:l, 3, 4, 6, 7, 9, 10, or 12, by standard techniques, such as 
site-directed mutagenesis and PCR-mediated mutagenesis. Preferably, conservative 
amino acid substitutions are made at one or more predicted non-essential amino acid 

20 residues. A "conservative amino acid substitution" is one in which the amino acid 

residue is replaced with an amino acid residue having a similar side chain. Families of 
amino acid residues having similar side chains have been defined in the art. These 
families include amino acids with basic side chains (e.g., lysine, arginine, histidine), 
acidic side chains (e.g., aspartic acid, glutamic acid), uncharged polar side chains (e.g., 

25 glycine, asparagine, glutamine, serine, threonine, tyrosine, cysteine), nonpolar side chains 
(e.g., alanine, valine, leucine, isoleucine, proline, phenylalanine, methionine, tryptophan), 
beta-branched side chains (e.g., threonine, valine, isoleucine) and aromatic side chains 
(e.g., tyrosine, phenylalanine, tryptophan, histidine). Thus, a predicted nonessential 
amino acid residue in an hVR-1, hVR-2, and rVR-2 protein is preferably replaced with 

30 another amino acid residue from the same side chain family. Alternatively, in another 
embodiment, mutations can be introduced randomly along all or part of an hVR-1, hVR- 
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2, and rVR-2 coding sequence, such as by saturation mutagenesis, and the resultant 
mutants can be screened for hVR-1, hVR-2, and rVR-2 biological activity to identify 
mutants that retain activity. Following mutagenesis of SEQ ID NO: 1.3,4, 6, 7, 9, 1 0, or 
12. 

5 In a embodiment, a mutant hVR-1 , hVR-2, and rVR-2 protein can be assayed for 

the ability to (1) interact with a non-hVR-1, non-hVR-2, or non- rVR-2 protein molecule, 
e.g., a vanilloid compound such as capsaicin; (2) modulate intracellular calcium 
concentration; (3) activate an hVR-1, hVR-2, and rVR-2-dependent signal transduction 
pathway; or (4) modulate a pain signaling mechanism. 

10 In addition to the nucleic acid molecules encoding hVR-1 , hVR-2, and rVR-2 

proteins described above, another aspect of the invention pertains to isolated nucleic acid 
molecules which are antisense thereto. An "antisense" nucleic acid comprises a 
nucleotide sequence which is complementary to a "sense" nucleic acid encoding a 
protein, e.g., complementary to the coding strand of a double-stranded cDNA molecule or 

1 5 complementary to an mRNA sequence. Accordingly, an antisense nucleic acid can 

hydrogen bond to a sense nucleic acid. The antisense nucleic acid can be complementary 
to an entire hVR-1, hVR-2, and rVR-2 coding strand, or to only a portion thereof. In one 
embodiment, an antisense nucleic acid molecule is antisense to a "coding region" of the 
coding strand of a nucleotide sequence encoding hVR-1, hVR-2, and rVR-2. The term 

20 "coding region" refers to the region of the nucleotide sequence comprising codons which 
are translated into amino acid residues {e.g., the coding region of hVR-1, hVR-2, and 
rVR-2). In another embodiment, the antisense nucleic acid molecule is antisense to a 
"noncoding region" of the coding strand of a nucleotide sequence encoding hVR-1, hVR- 
2, and rVR-2. The term "noncoding region" refers to 5' and 3' sequences which flank the 

25 coding region that are not translated into amino acids {i.e., also referred to as 5' and 3' 
untranslated regions). 

Given the coding strand sequences encoding hVR-1, hVR-2, and rVR-2 disclosed 
herein, antisense nucleic acids of the invention can be designed according to the rules of 
Watson and Crick base pairing. The antisense nucleic acid molecule can be 

30 complementary to the entire coding region of hVR-1, hVR-2, and rVR-2 mRNA, but 
more preferably is an oligonucleotide which is antisense to only a portion of the coding 
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or noncoding region of hVR-1, hVR-2, and rVR-2 mRNA. For example, the antisense 
oligonucleotide can be complementary to the region surrounding the translation start site 
of hVR-1, hVR-2, and rVR-2 mRNA. An antisense oligonucleotide can be, for example, 
about 5, 10, 15, 20, 25, 30, 35, 40, 45 or 50 nucleotides in length. An antisense nucleic 
5 acid of the invention can be constructed using chemical synthesis and enzymatic ligation 
reactions using procedures known in the art. For example, an antisense nucleic acid (e.g., 
an antisense oligonucleotide) can be chemically synthesized using naturally occurring 
nucleotides or variously modified nucleotides designed to increase the biological stability 
of the molecules or to increase the physical stability of the duplex formed between the 

10 antisense and sense nucleic acids, e.g., phosphorothioate derivatives and acridine 

substituted nucleotides can be used. Examples of modified nucleotides which can be 
used to generate the antisense nucleic acid include 5-fluorouracil, 5-bromouracil, 5- 
chlorouracil, 5-iodouracil, hypoxanthine, xantine, 4-acetylcytosine, 5- 
(carboxyhydroxylmethyl) uracil, 5-carboxymethylaminomethyl-2-thiouridine, 5- 

15 carboxymethylaminomethyluracil, dihydrouracil, beta-D-galactosylqueosine, inosine, 
N6-isopentenyladenine, 1 -methy Iguanine, 1-methylinosine, 2,2-dimethylguanine, 2- 
methyladenine, 2-methy Iguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 7- 
methylguanine, 5-methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, beta- 
D-mannosylqueosine, 5'-methoxycarboxymethyluracil, 5-methoxyuracil, 2-methylthio- 

20 N6-isopentenyladenine, uracil-5-oxyacetic acid (v), wybutoxosine, pseudouracil, 

queosine, 2-thiocytosine, 5 -methyl -2-thiouracil, 2-thiouracil, 4-thiouracil, 5-methyluracil, 
uraciI-5- oxyacetic acid methylester, uracil-5-oxyacetic acid (v), 5-methyl-2-thiouracil, 3- 
(3-amino-3-N-2-carboxypropyl) uracil, (acp3)w, and 2,6-diaminopurine. Alternatively, 
the antisense nucleic acid can be produced biologically using an expression vector into 

25 which a nucleic acid has been subcloned in an antisense orientation (i.e., RNA 

transcribed from the inserted nucleic acid will be of an antisense orientation to a target 
nucleic acid of interest, described further in the following subsection). 

The antisense nucleic acid molecules of the invention are typically administered 
to a subject or generated in situ such that they hybridize with or bind to cellular mRNA 

30 and/or genomic DNA encoding an hVR-1, hVR-2, and rVR-2 protein to thereby inhibit 
expression of the protein, e.g., by inhibiting transcription and/or translation. The 
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hybridization can be by conventional nucleotide complementarity to form a stable 
duplex, or, for example, in the case of an antisense nucleic acid molecule which binds to 
DNA duplexes, through specific interactions in the major groove of the double helix. 
An example of a route of administration of antisense nucleic acid molecules of the 
5 invention include direct injection at a tissue site. Alternatively, antisense nucleic acid 
molecules can be modified to target selected cells and then administered systemically. 
For example, for systemic administration, antisense molecules can be modified such that 
they specifically bind to receptors or antigens expressed on a selected cell surface, e.g., 
by linking the antisense nucleic acid molecules to peptides or antibodies which bind to 
10 cell surface receptors or antigens. The antisense nucleic acid molecules can also be 

delivered to cells using the vectors described herein. To achieve sufficient intracellular 
concentrations of the antisense molecules, vector constructs in which the antisense 
nucleic acid molecule is placed under the control of a strong pol II or pol HI promoter 
are preferred. 

15 In yet another embodiment, the antisense nucleic acid molecule of the invention 

is an -anomeric nucleic acid molecule. An a-anomeric nucleic acid molecule forms 
specific double-stranded hybrids with complementary RNA in which, contrary to the 
usual -units, the strands run parallel to each other (Gaultier et al. (1987) Nucleic Acids. 
Res. 1 5:6625-6641). The antisense nucleic acid molecule can also comprise a 2-o- 

20 methylribonucleotide (Inoue et al (1987) Nucleic Acids Res. 15:613 1-6148) or a 
chimeric RNA-DNA analogue (Inoue et al (1987) FEBS Lett. 215:327-330). 

In still another embodiment, an antisense nucleic acid of the invention is a 
ribozyme. Ribozymes are catalytic RNA molecules with ribonuclease activity which are 
capable of cleaving a single-stranded nucleic acid, such as an mRNA, to which they 

25 have a complementary region. Thus, ribozymes {e.g., hammerhead ribozymes 
(described in Haselhoff and Gerlach (1988) Nature 334:585-591)) can be used to 
catalytically cleave hVR-1, hVR-2, and rVR-2 mRNA transcripts to thereby inhibit 
translation of hVR-1, hVR-2, and rVR-2 mRNA. A ribozyme having specificity for an 
hVR-1, hVR-2, and rVR-2-encoding nucleic acid can be designed based upon the 

30 nucleotide sequence of an hVR- 1 , hVR-2, and rVR-2 cDNA disclosed herein {i.e. , SEQ 
ID NO:l, 3, 4, 6, 7, 9, 10, or 12). For example, a derivative of a Tetrahymena L-19 IVS 
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RNA can be constructed in which the nucleotide sequence of the active site is 
complementary to the nucleotide sequence to be cleaved in an hVR-1, hVR-2, and rVR- 
2-encoding mRNA. See, e.g., Cech et al U.S. Patent No. 4,987,071; and Cech et al 
U.S. Patent No. 5,1 16,742. Alternatively, hVR-1, hVR-2, and rVR-2 mRNA can be 
5 used to select a catalytic RNA having a specific ribonuclease activity from a pool of 
RNA molecules. See, for example, Battel, D. and Szostak, J.W. (1993) Science 
261:1411-1418. 

Alternatively, hVR-1, hVR-2, and rVR-2 gene expression can be inhibited by 
targeting nucleotide sequences complementary to the regulatory region of the hVR-1 , 

1 0 hVR-2, and rVR-2 {e.g. , the hVR- 1 , hVR-2, and rVR-2 promoter and/or enhancers) to 
form triple helical structures that prevent transcription of the hVR-1, hVR-2, and rVR-2 
gene in target cells. See generally, Helene, C. (1991) Anticancer Drug Des. 6(6):569- 
84; Helene, C. et al. (1992) Ann. N. Y. Acad ScL 660:27-36; and Maher, L.J. (1992) 
Bioassays 14(12):807-15. 

15 In yet another embodiment, the hVR-1 , hVR-2, and rVR-2 nucleic acid 

molecules of the present invention can be modified at the base moiety, sugar moiety or 
phosphate backbone to improve, e.g., the stability, hybridization, or solubility of the 
molecule. For example, the deoxyribose phosphate backbone of the nucleic acid 
molecules can be modified to generate peptide nucleic acids (see Hyrup B. et al. (1996) 

20 Bioorganic & Medicinal Chemistry 4(1): 5-23). As used herein, the terms "peptide 
nucleic acids" or "PNAs" refer to nucleic acid mimics, e.g., DNA mimics, in which the 
deoxyribose phosphate backbone is replaced by a pseudopeptide backbone and only the 
four natural nucleobases are retained. The neutral backbone of PNAs has been shown to 
allow for specific hybridization to DNA and RNA under conditions of low ionic 

25 strength. The synthesis of PNA oligomers can be performed using standard solid phase 
peptide synthesis protocols as described in Hyrup B. et al. (1996) supra; Perry-O'Keefe 
etal Proc. Natl. Acad. Sci. 93: 14670-675. 

PNAs of hVR-1, hVR-2, and rVR-2 nucleic acid molecules can be used in 
therapeutic and diagnostic applications. For example, PNAs can be used as antisense or 

30 antigene agents for sequence-specific modulation of gene expression by, for example, 
inducing transcription or translation arrest or inhibiting replication. PNAs of hVR-1, 
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hVR-2, and rVR-2 nucleic acid molecules can also be used in the analysis of single base 
pair mutations in a gene, (e.g., by PNA-directed PCR clamping); as 'artificial restriction 
enzymes' when used in combination with other enzymes, (e.g., SI nucleases (Hyrup B. 
(1996) supra)); or as probes or primers for DNA sequencing or hybridization (Hyrup B. 
5 et al. ( 1 996) supra; Perry-O'Keefe supra). 

In another embodiment, PNAs of hVR-1, hVR-2, and rVR-2 can be modified, 
(e.g., to enhance their stability or cellular uptake), by attaching lipophilic or other helper 
groups to PNA, by the formation of PNA-DNA chimeras, or by the use of liposomes or 
other techniques of drug delivery known in the art. For example, PNA-DNA chimeras 

10 of hVR-1, hVR-2, and rVR-2 nucleic acid molecules can be generated which may 
combine the advantageous properties of PNA and DNA. Such chimeras allow DNA 
recognition enzymes, (e.g., RNAse H and DNA polymerases), to interact with the DNA 
portion while the PNA portion would provide high binding affinity and specificity. 
PNA-DNA chimeras can be linked using linkers of appropriate lengths selected in terms 

15 of base stacking, number of bonds between the nucleobases, and orientation (Hyrup B. 
(1996) supra). The synthesis of PNA-DNA chimeras can be performed as described in 
Hyrup B. (1996) supra and Finn P.J. et al. (1996) Nucleic Acids Res. 24 (17): 3357-63. 
For example, a DNA chain can be synthesized on a solid support using standard 
phosphoramidite coupling chemistry and modified nucleoside analogs, e.g., 5'-(4- 

20 methoxytrityl)amino-5'-deoxy-thymidine phosphoramidite, can be used as a between the 
PNA and the 5' end of DNA (Mag, M. et al. (1989) Nucleic Acid Res. 1 7: 5973-88). 
PNA monomers are then coupled in a stepwise manner to produce a chimeric molecule 
with a 5' PNA segment and a 3' DNA segment (Finn P.J. et al. (1996) supra). 
Alternatively, chimeric molecules can be synthesized with a 5 1 DNA segment and a 3' 

25 PNA segment (Peterser, K.H. et al. (1975) Bioorganic Med. Chem. Lett. 5: 1119-1 1 124). 

In other embodiments, the oligonucleotide may include other appended groups 
such as peptides (e.g., for targeting host cell receptors in vivo), or agents facilitating 
transport across the cell membrane (see, e.g., Letsinger et al. (1989) Proc. Natl. Acad. 
Sci. USA 86:6553-6556; Lemaitre et al (1987) Proc. Natl Acad. ScL USA 84:648-652; 

30 PCT Publication No. W088/09810) or the blood-brain barrier (see, e.g, PCT Publication 
No. W089/10134). In addition, oligonucleotides can be modified with hybridization- 
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triggered cleavage agents (See, e.g., Krol et al. (1988) Bio-Techniques 6:958-976) or 
intercalating agents. (See, e.g., Zon (1988) Pharm. Res. 5:539-549). To this end, the 
oligonucleotide may be conjugated to another molecule, (e.g., a peptide, hybridization 
triggered cross-linking agent, transport agent, or hybridization-triggered cleavage agent). 

5 

II. Isolated hVR-1. hVR-2, and rVR-2 Proteins and Anti-hVR-1, Anti-hVR-2, and Anti- 
rVR-2 Antibodies 

One aspect of the invention pertains to isolated hVR- 1 , hVR-2, and rVR-2 
proteins, and biologically active portions thereof, as well as polypeptide fragments 

10 suitable for use as immunogens to raise anti-hVR-2, anti-hVR-2, and anti-rVR-2 
antibodies. In one embodiment, native hVR-1, hVR-2, and rVR-2 proteins can be 
isolated from cells or tissue sources by an appropriate purification scheme using 
standard protein purification techniques. In another embodiment, hVR-1, hVR-2, and 
rVR-2 proteins are produced by recombinant DNA techniques. Alternative to 

15 recombinant expression, an hVR-1, hVR-2, and rVR-2 protein or polypeptide can be 
synthesized chemically using standard peptide synthesis techniques. 

An "isolated" or "purified" protein or biologically active portion thereof is 
substantially free of cellular material or other contaminating proteins from the cell or 
tissue source from which the hVR-1, hVR-2, and rVR-2 protein is derived, or 

20 substantially free from chemical precursors or other chemicals when chemically 

synthesized. The language "substantially free of cellular material" includes preparations 
of hVR-1, hVR-2, and rVR-2 protein in which the protein is separated from cellular 
components of the cells from which it is isolated or recombinantly produced. In one 
embodiment, the language "substantially free of cellular material" includes preparations 

25 of hVR-1, hVR-2, and rVR-2 protein having less than about 30% (by dry weight) of 
non-hVR-1, hVR-2, and rVR-2 protein (also referred to herein as a "contaminating 
protein"), more preferably less than about 20% of non-hVR-1, hVR-2, and rVR-2 
protein, still more preferably less than about 10% of non-hVR-1, hVR-2, and rVR-2 
protein, and most preferably less than about 5% non-hVR-1, non-hVR-2, and non-rVR-2 

30 protein. When the hVR-1, hVR-2, and rVR-2 protein or biologically active portion 
thereof is recombinantly produced, it is also preferably substantially free of culture 
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medium, i.e., culture medium represents less than about 20%, more preferably less than 
about 10%, and most preferably less than about 5% of the volume of the protein 
preparation. 

The language "substantially free of chemical precursors or other chemicals" 
5 includes preparations of hVR-1, hVR-2, and rVR-2 protein in which the protein is 
separated from chemical precursors or other chemicals which are involved in the 
synthesis of the protein. In one embodiment, the language "substantially free of 
chemical precursors or other chemicals" includes preparations of hVR-1, hVR-2, and 
rVR-2 protein having less than about 30% (by dry weight) of chemical precursors or 
10 non-hVR-1, hVR-2, and rVR-2 chemicals, more preferably less than about 20% 

chemical precursors or non-hVR-1, hVR-2, and rVR-2 chemicals, still more preferably 
less than about 10% chemical precursors or non-hVR-1, hVR-2, and rVR-2 chemicals, 
and most preferably less than about 5% chemical precursors or non-hVR-1, hVR-2, and 
rVR-2 chemicals. 

15 As used herein, a "biologically active portion" of an hVR-1, hVR-2, and rVR-2 

protein includes a fragment of an hVR-1, hVR-2, and rVR-2 protein which participates 
in an interaction between an hVR-1, hVR-2, and rVR-2 molecule and a non-hVR-1, 
non-hVR-2, and non-rVR-2 molecule, respectively. Biologically active portions of an 
hVR-1, hVR-2, and rVR-2 protein include peptides comprising amino acid sequences 

20 sufficiently homologous to or derived from the amino acid sequence of the hVR-1, hVR- 
2, and rVR-2 protein, e.g., the amino acid sequence shown in SEQ ID NO:2, 5, 8, or 1 1, 
which include less amino acids than the full length hVR-1, hVR-2, and rVR-2 proteins, 
and exhibit at least one activity of an hVR-1, hVR-2, and rVR-2 protein. Typically, 
biologically active portions comprise a domain or motif with at least one activity of the 

25 hVR-1, hVR-2, and rVR-2 protein, e.g., binding of an hVR-1, hVR-2, and rVR-2 ligand 
such as a vanilloid compound, e.g., Capsaicin. A biologically active portion of an hVR- 
1, hVR-2, and rVR-2 protein can be a polypeptide which is, for example, 10, 20, 30, 40, 
50, 60, 70, 80, 90, 100, 200 or more amino acids in length. Biologically active portions 
of an hVR-1, hVR-2, and rVR-2 protein can be used as targets for developing agents 

30 which modulate an hVR-1, hVR-2, and rVR-2 mediated activity, e.g., a pain signaling 
mechanism. 
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In one embodiment, a biologically active portion of an hVR-I, hVR-2, and rVR- 
2 protein comprises at least one transmembrane domain, and/or at least one proline rich 
domain, and/or at least one ankyrin repeat domain. It is to be understood that a 
biologically active portion of an hVR-1, hVR-2, and rVR-2 protein of the present 
5 invention may contain at least one of the above-identified structural domains. A more 
biologically active portion of an hVR-1, hVR-2, and rVR-2 protein may contain at least 
two of the above-identified structural domains. Moreover, other biologically active 
portions, in which other regions of the protein are deleted, can be prepared by 
recombinant techniques and evaluated for one or more of the functional activities of a 
1 0 native hVR- 1 , hVR-2, and rVR-2 protein. 

In a embodiment, the hVR-1, hVR-2, and rVR-2 protein has an amino acid 
sequence shown in SEQ ID NO:2, 5, 8, or 1 1 . In other embodiments, the hVR-1, hVR- 
2. and rVR-2 protein is substantially homologous to SEQ ID NO:2, 5, 8, or 1 1, and 
retains the functional activity of the protein of SEQ ID NO:2, 5, 8, or 1 1 , yet differs in 
1 5 amino acid sequence due to natural allelic variation or mutagenesis, as described in 
detail in subsection I above. Accordingly, in another embodiment, the hVR-1, hVR-2, 
and rVR-2 protein is a protein which comprises an amino acid sequence at least about 
60%, 65%, 70%, 75%, 80%, 85%, 87%, 90%, 95%, 98% or more homologous to SEQ 
IDNO:2, 5, 8, or 11. 

20 To determine the percent identity of two amino acid sequences or of two nucleic 

acid sequences, the sequences are aligned for optimal comparison purposes (e.g., gaps 
can be introduced in one or both of a first and a second amino acid or nucleic acid 
sequence for optimal alignment and non-homologous sequences can be disregarded for 
comparison purposes). In a embodiment, the length of a reference sequence aligned for 

25 comparison purposes is at least 30%, preferably at least 40%, more preferably at least 

50%, even more preferably at least 60%, and even more preferably at least 70%, 80%, or 
90% of the length of the reference sequence (e.g., when aligning a second sequence to 
the hVR-1, hVR-2, and rVR-2 amino acid sequence of SEQ ID NO:2, 5, 8, or 1 1, having 
177 amino acid residues, at least 80, preferably at least 100, more preferably at least 120, 

30 even more preferably at least 140, and even more preferably at least 150, 160 or 170 
amino acid residues are aligned). The amino acid residues or nucleotides at 
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corresponding amino acid positions or nucleotide positions are then compared. When a 
position in the first sequence is occupied by the same amino acid residue or nucleotide 
as the corresponding position in the second sequence, then the molecules are identical at 
that position (as used herein amino acid or nucleic acid "identity" is equivalent to amino 
5 acid or nucleic acid "homology"). The percent identity between the two sequences is a 
function of the number of identical positions shared by the sequences, taking into 
account the number of gaps, and the length of each gap, which need to be introduced for 
optimal alignment of the two sequences. 

The comparison of sequences and determination of percent identity between two 

10 sequences can be accomplished using a mathematical algorithm. In a embodiment, the 
percent identity between two amino acid sequences is determined using the Needleman 
and Wunsch (./. Mol Biol (48):444-453 (1970)) algorithm which has been incorporated 
into the GAP program in the GCG software package (available at http://www.gcg.com), 
using either a Blossum 62 matrix or a PAM250 matrix, and a gap weight of 16, 14, 12, 

15 10, 8, 6, or 4 and a length weight of 1, 2, 3, 4, 5, or 6. In yet another embodiment, the 
percent identity between two nucleotide sequences is determined using the GAP 
program in the GCG software package (available at http://www.gcg.com), using a 
NWSgapdna.CMP matrix and a gap weight of 40, 50, 60, 70, or 80 and a length weight 
of 1, 2, 3, 4, 5, or 6. In another embodiment, the percent identity between two amino 

20 acid or nucleotide sequences is determined using the algorithm of E. Meyers and W. 

Miller (CABIOS, 4: 1 1-17 (1989)) which has been incorporated into the ALIGN program 
(version 2.0), using a PAM120 weight residue table, a gap length penalty of 12 and a 
gap penalty of 4. 

The nucleic acid and protein sequences of the present invention can further be 
25 used as a "query sequence" to perform a search against public databases to, for example, 
identify other family members or related sequences. Such searches can be performed 
using the NBLAST and XBLAST programs (version 2.0) of Altschul, et al (1990) J. 
Mol Biol 215:403-10. BLAST nucleotide searches can be performed with the 
NBLAST program, score = 100, wordlength = 12 to obtain nucleotide sequences 
30 homologous to hVR-1 , hVR-2, and rVR-2 nucleic acid molecules of the invention. 
BLAST protein searches can be performed with the XBLAST program, score = 50, 
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wordlength = 3 to obtain amino acid sequences homologous to hVR-1, hVR-2, and rVR- 
2 protein molecules of the invention. To obtain gapped alignments for comparison 
purposes. Gapped BLAST can be utilized as described in Altschul et aL. (1997) Nucleic 
Acids Res. 25(17):3389-3402. When utilizing BLAST and Gapped BLAST programs, 
5 the default parameters of the respective programs (e.g., XBLAST and NBLAST) can be 
used. See http://www.ncbi.nlm.nih.gov. 

The invention also provides hVR-1, hVR-2, and rVR-2 chimeric or fusion 
proteins. As used herein, an hVR-1, hVR-2, and rVR-2 "chimeric protein" or "fusion 
protein" comprises an hVR-1, hVR-2, and rVR-2 polypeptide operatively linked to a 

1 0 non-hVR- 1 , h VR-2, and rVR-2 polypeptide. An "h VR- 1 , hVR-2, and rVR-2 

polypeptide" refers to a polypeptide having an amino acid sequence corresponding to 
hVR-1, hVR-2, and rVR-2, whereas a "non-hVR-l, non-hVR-2, and non-rVR-2 
polypeptide" refers to a polypeptide having an amino acid sequence corresponding to a 
protein which is not substantially homologous to the hVR-1, hVR-2, and rVR-2 protein, 

15 e.g., a protein which is different from the hVR-1, hVR-2, and rVR-2 protein and which 
is derived from the same or a different organism. Within an hVR-1 , hVR-2, and rVR-2 
fusion protein the hVR-1, hVR-2, and rVR-2 polypeptide can correspond to all or a 
portion of an hVR-1, hVR-2, and rVR-2 protein. In a embodiment, an hVR-1, hVR-2, 
and rVR-2 fusion protein comprises at least one biologically active portion of an hVR-1, 

20 hVR-2, and rVR-2 protein. In another embodiment, an hVR-1, hVR-2, and rVR-2 

fusion protein comprises at least two biologically active portions of an hVR-1, hVR-2, 
and rVR-2 protein. Within the fusion protein, the term "operatively linked" is intended 
to indicate that the hVR-1, hVR-2, and rVR-2 polypeptide and the non-hVR-l, non- 
hVR-2, and non-rVR-2 polypeptide are fused in-frame to each other. The non-hVR-l, 

25 hVR-2, and rVR-2 polypeptide can be fused to the N-terminus or C-terminus of the 
hVR-1, hVR-2, and rVR-2 polypeptide. 

For example, in one embodiment, the fusion protein is a GST-hVR-1, GST- 
hVR-2, and GST-rVR-2 fusion protein in which the hVR-1, hVR-2, and rVR-2 
sequences are fused to the C-terminus of the GST sequences. Such fusion proteins can 

30 facilitate the purification of recombinant hVR-1 , hVR-2, and rVR-2. 
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In another embodiment, the fusion protein is an hVR-1, hVR-2, and rVR-2 
protein containing a heterologous signal sequence at its N-terminus. In certain host cells 
{e.g., mammalian host cells), expression and/or secretion of hVR-1, hVR-2, and rVR-2 
can be increased through use of a heterologous signal sequence. 
5 The hVR-1 , hVR-2, and rVR-2 fusion proteins of the invention can be 

incorporated into pharmaceutical compositions and administered to a subject in vivo. 
The hVR-1, hVR-2, and rVR-2 fusion proteins can be used to affect the bioavailability 
of an hVR-1, hVR-2, and rVR-2 substrate. Use of hVR-1, hVR-2, and rVR-2 fusion 
proteins may be useful therapeutically for the treatment of disorders caused by, for 

10 example, (i) aberrant modification or mutation of a gene encoding an hVR-1, hVR-2, 
and rVR-2 protein; (ii) mis-regulation of the hVR-1, hVR-2, and rVR-2 gene; and (iii) 
aberrant post-translational modification of an hVR-K hVR-2, and rVR-2 protein. 

Moreover, the hVR-1, hVR-2, and rVR-2-fusion proteins of the invention can be 
used as immunogens to produce anti-hVR-L anti-hVR-2, and anti-rVR-2 antibodies in a 

1 5 subject, to purify hVR-1, hVR-2, and rVR-2 ligands and in screening assays to identify 
molecules which inhibit the interaction of hVR-1, hVR-2, and rVR-2 with an hVR-1, 
hVR-2, and rVR-2 substrate. 

Preferably, an hVR-1, hVR-2, and rVR-2 chimeric or fusion protein of the 
invention is produced by standard recombinant DNA techniques. For example, DNA 

20 fragments coding for the different polypeptide sequences are ligated together in-frame in 
accordance with conventional techniques, for example by employing blunt-ended or 
stagger-ended termini for ligation, restriction enzyme digestion to provide for 
appropriate termini, filling-in of cohesive ends as appropriate, alkaline phosphatase 
treatment to avoid undesirable joining, and enzymatic ligation. In another embodiment, 

25 the fusion gene can be synthesized by conventional techniques including automated 
DNA synthesizers. Alternatively, PCR amplification of gene fragments can be carried 
out using anchor primers which give rise to complementary overhangs between two 
consecutive gene fragments which can subsequently be annealed and reamplified to 
generate a chimeric gene sequence (see, for example, Current Protocols in Molecular 

30 Biology, eds. Ausubel et al John Wiley & Sons: 1992). Moreover, many expression 
vectors are commercially available that already encode a fusion moiety {e.g., a GST 
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polypeptide). An hVR-1, hVR-2, and rVR-2-encoding nucleic acid can be cloned into 
such an expression vector such that the fusion moiety is linked in-frame to the hVR-1, 
hVR-2, and rVR-2 protein. 

The present invention also pertains to variants of the hVR-1 , hVR-2, and rVR-2 
5 proteins which function as either hVR-1, hVR-2, and rVR-2 agonists (mimetics) or as 
hVR-I, hVR-2, and rVR-2 antagonists. Variants of the hVR-1, hVR-2 ? and rVR-2 
proteins can be generated by mutagenesis, e.g., discrete point mutation or truncation of 
an hVR-1, hVR-2, and rVR-2 protein. An agonist of the hVR-1, hVR-2, and rVR-2 
proteins can retain substantially the same, or a subset, of the biological activities of the 

10 naturally occurring form of an hVR-1, hVR-2, and rVR-2 protein. An antagonist of an 
hVR-1, hVR-2, and rVR-2 protein can inhibit one or more of the activities of the 
naturally occurring form of the hVR-1, hVR-2, and rVR-2 protein by ? for example, 
competitively modulating an hVR-1, hVR-2, and rVR-2 -mediated activity of an hVR-1, 
hVR-2, and rVR-2 protein. Thus, specific biological effects can be elicited by treatment 

1 5 with a variant of limited function. In one embodiment, treatment of a subject with a 
variant having a subset of the biological activities of the naturally occurring form of the 
protein has fewer side effects in a subject relative to treatment with the naturally 
occurring form of the hVR-1, hVR-2, and rVR-2 protein. 

In one embodiment, variants of an hVR-1, hVR-2, and rVR-2 protein which 

20 function as either hVR-1, hVR-2, and rVR-2 agonists (mimetics) or as hVR-1, hVR-2, 
and rVR-2 antagonists can be identified by screening combinatorial libraries of mutants, 
e.g., truncation mutants, of an hVR-1, hVR-2, and rVR-2 protein for hVR-1, hVR-2, and 
rVR-2 protein agonist or antagonist activity. In one embodiment, a variegated library of 
hVR-1, hVR-2, and rVR-2 variants is generated by combinatorial mutagenesis at the 

25 nucleic acid level and is encoded by a variegated gene library. A variegated library of 
hVR-1, hVR-2, and rVR-2 variants can be produced by, for example, enzymatically 
Iigating a mixture of synthetic oligonucleotides into gene sequences such that a 
degenerate set of potential hVR-1, hVR-2, and rVR-2 sequences is expressible as 
individual polypeptides, or alternatively, as a set of larger fusion proteins {e.g., for phage 

30 display) containing the set of hVR-1, hVR-2, and rVR-2 sequences therein. There are a 
variety of methods which can be used to produce libraries of potential hVR-1, hVR-2, 



BNSDOCID: < WO 0029577 A 1 _(_> 



WO 00/29577 PCT/US99/26701 

-36- 

and rVR-2 variants from a degenerate oligonucleotide sequence. Chemical synthesis of 
a degenerate gene sequence can be performed in an automatic DNA synthesizer, and the 
synthetic gene then ligated into an appropriate expression vector. Use of a degenerate 
set of genes allows for the provision, in one mixture, of all of the sequences encoding 
5 the desired set of potential hVR-1 , hVR-2 ? and rVR-2 sequences. Methods for 

synthesizing degenerate oligonucleotides are known in the art (see, e.g., Narang, S.A. 
(1983) Tetrahedron 39:3; Itakura et al (1984) Annu. Rev. Biochem. 53:323; Itakurae/ 
al (1984) Science 198:1056; Ike et al (1983) Nucleic Acid Res. 11:477. 

In addition, libraries of fragments of an hVR-1, hVR-2, and rVR-2 protein 

10 coding sequence can be used to generate a variegated population of hVR-1, hVR-2, and 
rVR-2 fragments for screening and subsequent selection of variants of an hVR-1, hVR- 
2, and rVR-2 protein. In one embodiment, a library of coding sequence fragments can 
be generated by treating a double stranded PCR fragment of an hVR-1, hVR-2, and 
rVR-2 coding sequence with a nuclease under conditions wherein nicking occurs only 

15 about once per molecule, denaturing the double stranded DNA, renaturing the DNA to 
form double stranded DNA which can include sense/antisense pairs from different 
nicked products, removing single stranded portions from reformed duplexes by 
treatment with SI nuclease, and ligating the resulting fragment library into an expression 
vector. By this method, an expression library can be derived which encodes N-terminal, 

20 C-terminal and internal fragments of various sizes of the hVR-1, hVR-2, and rVR-2 
protein. 

Several techniques are known in the art for screening gene products of 
combinatorial libraries made by point mutations or truncation, and for screening cDNA 
libraries for gene products having a selected property. Such techniques are adaptable for 

25 rapid screening of the gene libraries generated by the combinatorial mutagenesis of 
hVR-1, hVR-2, and rVR-2 proteins. The most widely used techniques, which are 
amenable to high through-put analysis, for screening large gene libraries typically 
include cloning the gene library into replicable expression vectors, transforming 
appropriate cells with the resulting library of vectors, and expressing the combinatorial 

30 genes under conditions in which detection of a desired activity facilitates isolation of the 
vector encoding the gene whose product was detected. Recrusive ensemble mutagenesis 
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(REM), a new technique which enhances the frequency of functional mutants in the 
libraries, can be used in combination with the screening assays to identify hVR-1, hVR- 
2, and rVR-2 variants (Arkin and Yourvan (1992) Proc. Natl. Acad. Sci. USA £9:781 1- 
7815; Delgrave et al. (1993) Protein Engineering 6(3):327-331). 
5 In one embodiment, cell based assays can be exploited to analyze a variegated 

hVR-1, hVR-2, and rVR-2 library. For example, a library of expression vectors can be 
transfected into a cell line, e.g., a neuronal cell line, which ordinarily responds to a 
particular ligand in an hVR-1, hVR-2, and rVR-2-dependent manner. The transfected 
cells are then contacted with the ligand and the effect of expression of the mutant on 

10 signaling by the ligand can be detected, e.g., by measuring intracellular calcium 

concentration, neuronal membrane depolarization, or the activity of an hVR-1, hVR-2, 
and rVR-2-regulated transcription factor. Plasmid DNA can then be recovered from the 
cells which score for inhibition, or alternatively, potentiation of signaling by the ligand, 
and the individual clones further characterized. 

1 5 An isolated hVR-1 , hVR-2, and rVR-2 protein, or a portion or fragment thereof, 

can be used as an immunogen to generate antibodies that bind hVR-1, hVR-2, and rVR- 
2 using standard techniques for polyclonal and monoclonal antibody preparation. A 
full-length hVR-1, hVR-2, and rVR-2 protein can be used or, alternatively, the invention 
provides antigenic peptide fragments of hVR-1, hVR-2, and rVR-2 for use as 

20 immunogens. The antigenic peptide of hVR-1, hVR-2, and rVR-2 comprises at least 8 
amino acid residues of the amino acid sequence shown in SEQ ID NO:2, 5, 8, or 1 1 and 
encompasses an epitope of hVR-1, hVR-2, and rVR-2 such that an antibody raised 
against the peptide forms a specific immune complex with hVR-1, hVR-2, and rVR-2. 
Preferably, the antigenic peptide comprises at least 10 amino acid residues, more 

25 preferably at least 15 amino acid residues, even more preferably at least 20 amino acid 
residues, and most preferably at least 30 amino acid residues. 

Epitopes encompassed by the antigenic peptide are regions of hVR-1, hVR-2, 
and rVR-2 that are located on the surface of the protein, e.g., hydrophilic regions, as 
well as regions with high antigenicity (see, for example, Figures 12 and 14). 
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An hVR-1, hVR-2, and rVR-2 immunogen typically is used to prepare antibodies 
by immunizing a suitable subject, (e.g., rabbit, goat, mouse or other mammal) with the 
immunogen. An appropriate immunogenic preparation can contain, for example, 
recombinantly expressed hVR-1, hVR-2, and rVR-2 protein or a chemically synthesized 
5 hVR-1, hVR-2 ? and rVR-2 polypeptide. The preparation can further include an 
adjuvant, such as Freund's complete or incomplete adjuvant, or similar 
immunostimulatory agent. Immunization of a suitable subject with an immunogenic 
hVR-1, hVR-2, and rVR-2 preparation induces a polyclonal anti-hVR-1, anti-hVR-2, 
and anti-rVR-2 antibody response. 

10 Accordingly, another aspect of the invention pertains to anti-hVR-1, anti-hVR-2, 

and anti-rVR-2 antibodies. The term "antibody" as used herein refers to 
immunoglobulin molecules and immunologically active portions of immunoglobulin 
molecules, i.e., molecules that contain an antigen binding site which specifically binds 
(immunoreacts with) an antigen, such as hVR-1, hVR-2, and rVR-2. Examples of 

15 immunologically active portions of immunoglobulin molecules include F(ab) and 

F(ab')2 fragments which can be generated by treating the antibody with an enzyme such 
as pepsin. The invention provides polyclonal and monoclonal antibodies that bind hVR- 
1, hVR-2, and rVR-2. The term "monoclonal antibody" or "monoclonal antibody 
composition", as used herein, refers to a population of antibody molecules that contain 

20 only one species of an antigen binding site capable of immunoreacting with a particular 
epitope of hVR-1, hVR-2, and rVR-2. A monoclonal antibody composition thus 
typically displays a single binding affinity for a particular hVR-1, hVR-2, and rVR-2 
protein with which it immunoreacts. 

Polyclonal anti-hVR-1, anti-hVR-2, and anti-rVR-2 antibodies can be prepared 

25 as described above by immunizing a suitable subject with an hVR-1, hVR-2, and rVR-2 
immunogen. The anti-hVR-1, anti-hVR-2, and anti-rVR-2 antibody titer in the 
immunized subject can be monitored over time by standard techniques, such as with an 
enzyme linked immunosorbent assay (ELISA) using immobilized hVR-l ? hVR-2, and 
rVR-2. If desired, the antibody molecules directed against hVR-1, hVR-2, and rVR-2 

30 can be isolated from the mammal (e.g., from the blood) and further purified by well 

known techniques, such as protein A chromatography to obtain the IgG fraction. At an 
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appropriate time after immunization, e.g., when the anti-hVR-1, anti-hVR-2 ? and anti- 
rVR-2 antibody titers are highest, antibody-producing cells can be obtained from the 
subject and used to prepare monoclonal antibodies by standard techniques, such as the 
hybridoma technique originally described by Kohler and Milstein (1975) Nature 
5 256:495-497) (see also, Brown et al (1981) J. Immunol 127:539-46; Brown et al 
(1980) J. Biol Chem .255:4980-83; Yehetal. (1976) Proc. Natl Acad Sci. USA 
76:2927-3 1 ; and Yeh et al (1 982) Int. J. Cancer 29:269-75), the more recent human B 
cell hybridoma technique (Kozbor et al (1983) Immunol Today 4:72), the EBV- 
hybridoma technique (Cole et al (1985), Monoclonal Antibodies and Cancer Therapy, 

10 Alan R. Liss, Inc., pp. 77-96) or trioma techniques. The technology for producing 
monoclonal antibody hybridomas is well known (see generally R. H. Kenneth, in 
Monoclonal Antibodies: A New Dimension In Biological Analyses, Plenum Publishing 
Corp., New York, New York (1980); E. A. Lerner (1981) Yale J. Biol Med, 54:387402; 
M. L. Gefter et al (1977) Somatic Cell Genet. 3:23136). Briefly, an immortal cell line 

1 5 (typically a myeloma) is fused to lymphocytes (typically splenocytes) from a mammal 
immunized with an hVR-1, hVR-2, and rVR-2 immunogen as described above, and the 
culture supematants of the resulting hybridoma cells are screened to identify a 
hybridoma producing a monoclonal antibody that binds hVR-1, hVR-2, and rVR-2. 
Any of the many well known protocols used for fusing lymphocytes and 

20 immortalized cell lines can be applied for the purpose of generating an anti-hVR-1, anti- 
hVR-2, and anti-rVR-2 monoclonal antibodies (see, e.g., G. Galfre et al (1977) Nature 
266:55052; Gefter et al Somatic Cell Genet., cited supra; Lerner, Yale J. Biol Med., 
cited supra; Kenneth, Monoclonal Antibodies, cited supra). Moreover, the ordinarily 
skilled worker will appreciate that there are many variations of such methods which also 

25 would be useful. Typically, the immortal cell line (e.g., a myeloma cell line) is derived 
from the same mammalian species as the lymphocytes. For example, murine 
hybridomas can be made by fusing lymphocytes from a mouse immunized with an 
immunogenic preparation of the present invention with an immortalized mouse cell line, 
immortal cell lines are mouse myeloma cell lines that are sensitive to culture medium 

30 containing hypoxanthine, aminopterin and thymidine ("HAT medium"). Any of a 
number of myeloma cell lines can be used as a ftision partner according to standard 
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techniques, e.g., the P3-NSl/l-Ag4-l, P3-x63-Ag8.653 or Sp2/OAgl4 myeloma lines. 
These myeloma lines are available from ATCC. Typically, HAT-sensitive mouse 
myeloma cells are fused to mouse splenocytes using polyethylene glycol ("PEG"). 
Hybridoma cells resulting from the fusion are then selected using HAT medium, which 
5 kills unfused and unproductively fused myeloma cells (unfused splenocytes die after 
several days because they are not transformed). Hybridoma cells producing a 
monoclonal antibody of the invention are detected by screening the hybridoma culture 
supernatants for antibodies that bind hVR-1, hVR-2, and rVR-2, e.g., using a standard 
ELISA assay. 

10 Alternative to preparing monoclonal antibody-secreting hybridomas, a 

monoclonal anti-hVR-1, anti-hVR-2, and anti-rVR-2 antibody can be identified and 
isolated by screening a recombinant combinatorial immunoglobulin library {e.g., an 
antibody phage display library) with hVR-1, hVR-2, and rVR-2 to thereby isolate 
immunoglobulin library members that bind hVR-1, hVR-2, and rVR-2. Kits for 

15 generating and screening phage display libraries are commercially available {e.g., the 
Pharmacia Recombinant Phage Antibody System, Catalog No. 27-9400-01; and the 
Stratagene Sur/ZAP™ Phage Display Kit, Catalog No. 240612). Additionally, examples 
of methods and reagents particularly amenable for use in generating and screening 
antibody display library can be found in, for example, Ladner et al U.S. Patent No. 

20 5,223,409; Kang et al PCT International Publication No. WO 92/18619; Dower et al. 
PCT International Publication No. WO 91/17271; Winter et al. PCT International 
Publication WO 92/20791; Markland et al. PCT International Publication No. WO 
92/15679; Breitling et al PCT International Publication WO 93/01288; McCafferty et 
al PCT International Publication No. WO 92/01047; Garrard et al. PCT International 

25 Publication No. WO 92/09690; Ladner et al PCT International Publication No. WO 
90/02809; Fuchs et al (1991) Bio/Technology 9:1370-1372; Hay et al (1992) Hum. 
Antibod. Hybridomas 3:81-85; Huse et al. (1989) Science 246:1275-1281; Griffiths et al 
(1993) EMBO J 12:725-734; Hawkins et al (1992)7. Mol Biol 226:889-896; Clarkson 
et al (1991) Nature 352:624-628; Gram et al (1992) Proc. Natl Acad. Sci. USA 

30 89:3576-3580; Garrad et al (1991) Bio/Technology 9:1373-1377; Hoogenboom et al 
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(1991) Nuc. Acid Res. 19:4133-4137; Barbas et aL (1991) Proc. NatL Acad. ScL USA 
88:7978-7982; and McCafferty et aL Nature (1990) 348:552-554. 

Additionally, recombinant anti-hVR-1, anti-hVR-2, and anti-rVR-2 antibodies, 
such as chimeric and humanized monoclonal antibodies, comprising both human and 
5 non-human portions, which can be made using standard recombinant DNA techniques, 
are within the scope of the invention. Such chimeric and humanized monoclonal 
antibodies can be produced by recombinant DNA techniques known in the art, for 
example using methods described in Robinson et aL International Application No. 
PCT/US86/02269; Akira, et aL European Patent Application 184,187; Taniguchi, M., 

10 European Patent Application 171,496; Morrison et aL European Patent Application 
173,494; Neuberger et aL PCT International Publication No. WO 86/01533; Cabilly et 
aL U.S. Patent No. 4,816,567; Cabilly et aL European Patent Application 125,023; 
Better*?/ aL (1988) Science 240:1041-1043; UxxetaL (1987) Proc. NatL Acad. Sci. USA 
84:3439-3443; Liu et aL (1987) J. Immunol. 139:3521-3526; Sun et aL (1987) Proc. 

15 NatL Acad. ScL USA 84:214-218; Nishimura et aL (1987) Cane. Res. 47:999-1005; 
Wood et aL (1985) Nature 3 14:446-449; and Shaw et aL (1988) J. NatL Cancer Inst. 
80:1553-1559); Morrison, S. L. (1985) Science 229:1202-1207; Oi et aL (1986) 
BioTechniques 4:214; Winter U.S. Patent 5,225,539; Jones et aL (1986) Nature 
321:552-525; Verhoeyan et aL (1988) Science 239:1534; and Beidler et aL (1988) J. 

20 Immunol. 141:4053-4060. 

An anti-hVR-1, anti-hVR-2, and anti-rVR-2 antibody (e.g., monoclonal 
antibody) can be used to isolate hVR-1, hVR-2, and rVR-2 by standard techniques, such 
as affinity chromatography or immunoprecipitation. An anti-hVR-1, anti-hVR-2, and 
anti-rVR-2 antibody can facilitate the purification of natural hVR-1, hVR-2, and rVR-2 

25 from cells and of recombinantly produced hVR-1 , hVR-2, and rVR-2 expressed in host 
cells. Moreover, an anti-hVR-1 , anti-hVR-2, and anti-rVR-2 antibody can be used to 
detect hVR-1, hVR-2, and rVR-2 protein (e.g., in a cellular lysate or cell supernatant) in 
order to evaluate the abundance and pattern of expression of the hVR-1 , hVR-2, and 
rVR-2 protein. Anti-hVR-1, anti-hVR-2, and anti-rVR-2 antibodies can be used 

30 diagnostically to monitor protein levels in tissue as part of a clinical testing procedure, 
e.g., to, for example, determine the efficacy of a given treatment regimen. Detection can 



BNSDOCID: <WO 0Q29577A1 J_> 



WO 00/29577 PCT/US99/26701 

-42 - 

be facilitated by coupling (i.e., physically linking) the antibody to a detectable 
substance. Examples of detectable substances include various enzymes, prosthetic 
groups, fluorescent materials, luminescent materials, bioluminescent materials, and 
radioactive materials. Examples of suitable enzymes include horseradish peroxidase, 
5 alkaline phosphatase, -galactosidase, or acetylcholinesterase; examples of suitable 
prosthetic group complexes include streptavidin/biotin and avidin/biotin; examples of 
suitable fluorescent materials include umbelliferone, fluorescein, fluorescein 
isothiocyanate, rhodamine, dichlorotriazinylamine fluorescein, dansyl chloride or 
phycoerythrin; an example of a luminescent material includes luminol; examples of 
10 bioluminescent materials include luciferase, luciferin, and aequorin, and examples of 
suitable radioactive material include I25 I, 13 l l, 35 S or 3 H. 



III. Recombinant Expression Vectors and Host Cells 

Another aspect of the invention pertains to vectors, preferably expression 

1 5 vectors, containing a nucleic acid encoding an h VR- 1 , hVR-2, and rVR-2 protein (or a 
portion thereof)- As used herein, the term "vector" refers to a nucleic acid molecule 
capable of transporting another nucleic acid to which it has been linked. One type of 
vector is a "plasmid", which refers to a circular double stranded DNA loop into which 
additional DNA segments can be ligated. Another type of vector is a viral vector, 

20 wherein additional DNA segments can be ligated into the viral genome. Certain vectors 
are capable of autonomous replication in a host cell into which they are introduced (e.g., 
bacterial vectors having a bacterial origin of replication and episomal mammalian 
vectors). Other vectors (e.g., non-episomal mammalian vectors) are integrated into the 
genome of a host cell upon introduction into the host cell, and thereby are replicated 

25 along with the host genome. Moreover, certain vectors are capable of directing the 
expression of genes to which they are operatively linked. Such vectors are referred to 
herein as "expression vectors". In general, expression vectors of utility in recombinant 
DNA techniques are often in the form of plasmids. In the present specification, 
"plasmid" and "vector" can be used interchangeably as the plasmid is the most 

30 commonly used form of vector. However, the invention is intended to include such 
other forms of expression vectors, such as viral vectors (e.g., replication defective 
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retroviruses, adenoviruses and adeno-associated viruses), which serve equivalent 
functions. 

The recombinant expression vectors of the invention comprise a nucleic acid of 
the invention in a form suitable for expression of the nucleic acid in a host cell, which 
5 means that the recombinant expression vectors include one or more regulatory 

sequences, selected on the basis of the host cells to be used for expression, which is 
operatively linked to the nucleic acid sequence to be expressed. Within a recombinant 
expression vector, "operably linked" is intended to mean that the nucleotide sequence of 
interest is linked to the regulatory sequence(s) in a manner which allows for expression 
10 of the nucleotide sequence (e.g., in an in vitro transcription/translation system or in a 
host cell when the vector is introduced into the host cell). The term "regulatory 
sequence" is intended to include promoters, enhancers and other expression control 
elements (e.g., polyadenylation signals). Such regulatory sequences are described, for 
example, in Goeddel; Gene Expression Technology: Methods in Enzymology 185, 

1 5 Academic Press, San Diego, CA (1990). Regulatory sequences include those which 
direct constitutive expression of a nucleotide sequence in many types of host cells and 
those which direct expression of the nucleotide sequence only in certain host cells (e.g., 
tissue-specific regulatory sequences). It will be appreciated by those skilled in the art 
that the design of the expression vector can depend on such factors as the choice of the 

20 host cell to be transformed, the level of expression of protein desired, and the like. The 
expression vectors of the invention can be introduced into host cells to thereby produce 
proteins or peptides, including fusion proteins or peptides, encoded by nucleic acids as 
described herein (e.g., hVR-1, hVR-2, and rVR-2 proteins, mutant forms of hVR-1, 
hVR-2, and rVR-2 proteins, fusion proteins, and the like). 

25 The recombinant expression vectors of the invention can be designed for 

expression of hVR-1, hVR-2, and rVR-2 proteins in prokaryotic or eukaryotic cells. For 
example, hVR-1, hVR-2, and rVR-2 proteins can be expressed in bacterial cells such as 
E. colt, insect cells (using baculovirus expression vectors) yeast cells or mammalian 
cells. Suitable host cells are discussed further in Goeddel, Gene Expression Technology: 

30 Methods in Enzymology 185, Academic Press, San Diego, CA (1990). Alternatively, the 
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recombinant expression vector can be transcribed and translated in vitro, for example 
using T7 promoter regulatory sequences and T7 polymerase. 

Expression of proteins in prokaryotes is most often carried out in E. coli with 
vectors containing constitutive or inducible promoters directing the expression of either 
5 fusion or non-fusion proteins. Fusion vectors add a number of amino acids to a protein 
encoded therein, usually to the amino terminus of the recombinant protein. Such fusion 
vectors typically serve three purposes: 1) to increase expression of recombinant protein; 
2) to increase the solubility of the recombinant protein; and 3) to aid in the purification 
of the recombinant protein by acting as a ligand in affinity purification. Often, in fusion 
10 expression vectors, a proteolytic cleavage site is introduced at the junction of the fusion 
moiety and the recombinant protein to enable separation of the recombinant protein from 
the fusion moiety subsequent to purification of the fusion protein. Such enzymes, and 
their cognate recognition sequences, include Factor Xa, thrombin and enterokinase. 
Typical fusion expression vectors include pGEX (Pharmacia Biotech Inc; Smith, D.B. 
1 5 and Johnson, K.S. ( 1 988) Gene 67:3 1 -40), pMAL (New England Biolabs, Beverly, MA) 
and pRIT5 (Pharmacia, Piscataway, NJ) which fuse glutathione S-transferase (GST), 
maltose E binding protein, or protein A, respectively, to the target recombinant protein. 

Purified fusion proteins can be utilized in hVR-1, hVR-2, and rVR-2 activity 
assays, (e.g., direct assays or competitive assays described in detail below), or to, for 
20 example, generate antibodies specific for hVR-1, hVR-2, and rVR-2 proteins. In a 
embodiment, an hVR-1, hVR-2, and rVR-2 fusion protein expressed in a retroviral 
expression vector of the present invention can be utilized to infect bone marrow cells 
which are subsequently transplanted into irradiated recipients. The pathology of the 
subject recipient is then examined after sufficient time has passed (e.g., six (6) weeks). 
25 Examples of suitable inducible non-fusion E. coli expression vectors include 

pTrc (Amann et al, (1988) Gene 69:301-3 15) and pET 1 Id (Studier et aL 9 Gene 
Expression Technology: Methods in Enzymology 185, Academic Press, San Diego, 
California (1990) 60-89). Target gene expression from the pTrc vector relies on host 
RNA polymerase transcription from a hybrid trp-lac fusion promoter. Target gene 
30 expression from the pET 1 Id vector relies on transcription from a T7 gnlO-lac fusion 
promoter mediated by a coexpressed viral RNA polymerase (T7 gnl). This viral 
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polymerase is supplied by host strains BL21(DE3) or HMS174(DE3) from a resident 
prophage harboring a T7 gnl gene under the transcriptional control of the lacUV 5 
promoter. 

One strategy to maximize recombinant protein expression in E. coli is to express 
5 the protein in a host bacteria with an impaired capacity to proteolytically cleave the 
recombinant protein (Gottesman, S., Gene Expression Technology: Methods in 
Enzymology 185, Academic Press, San Diego, California (1990) 1 19-128). Another 
strategy is to alter the nucleic acid sequence of the nucleic acid to be inserted into an 
expression vector so that the individual codons for each amino acid are those 
1 0 preferentially utilized in £. coli ( Wada et al , ( 1 992) Nucleic Acids Res, 20:2 1 1 1 -2 1 1 8). 
Such alteration of nucleic acid sequences of the invention can be carried out by standard 
DNA synthesis techniques. 

In another embodiment, the hVR-1, hVR-2, and rVR-2 expression vector is a 
yeast expression vector. Examples of vectors for expression in yeast S. cerivisae include 
1 5 pYepSecl (Baldari, et al, (1987) Embo J. 6:229-234), pMFa (Kurjan and Herskowitz, 

(1982) Cell 30:933-943), pJRY88 (Schultz et al, (1987) Gene 54:1 13-123), pYES2 
(Invitrogen Corporation, San Diego, CA), and picZ (InVitrogen Corp, San Diego, CA). 

Alternatively, hVR- 1 , hVR-2, and rVR-2 proteins can be expressed in insect 
cells using baculovirus expression vectors. Baculovirus vectors available for expression 
20 of proteins in cultured insect cells (e.g., Sf 9 cells) include the pAc series (Smith et al 

(1983) Mol Cell Biol 3:2156-2165) and the pVL series (Lucklow and Summers (1989) 
Virology 170:31-39). 

In yet another embodiment, a nucleic acid of the invention is expressed in 
mammalian cells using a mammalian expression vector. Examples of mammalian 

25 expression vectors include pCDM8 (Seed, B. (1987) Nature 329:840) and pMT2PC 
(Kaufman et al (1987) EMBO J. 6:187-195). When used in mammalian cells, the 
expression vector's control functions are often provided by viral regulatory elements. 
For example, commonly used promoters are derived from polyoma, Adenovirus 2, 
cytomegalovirus and Simian Virus 40. For other suitable expression systems for both 

30 prokaryotic and eukaryotic cells see chapters 16 and 17 of Sambrook, J., Fritsh, E. F., 
and Maniatis, T. Molecular Cloning: A Laboratory Manual 2nd, ed, Cold Spring 
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Harbor Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 
1989. 

In another embodiment, the recombinant mammalian expression vector is 
capable of directing expression of the nucleic acid preferentially in a particular cell type 
5 (e.g., tissue-specific regulatory elements are used to express the nucleic acid). Tissue- 
specific regulatory elements are known in the art. Non-limiting examples of suitable 
tissue-specific promoters include the albumin promoter (liver-specific; Pinkert et al 
(1987) Genes Dev. 1:268-277), lymphoid-specific promoters (Calame and Eaton (1988) 
Adv. Immunol 43:235-275), in particular promoters of T cell receptors (Winoto and 

10 Baltimore (1989) EMBO J. 8:729-733) and immunoglobulins (Banerji et al. (1983) Cell 
33:729-740; Queen and Baltimore (1983) Cell 33:741-748), neuron-specific promoters 
(e g the neurofilament promoter; Byrne and Ruddle (1989) Proc. Natl. Acad Sci. USA 
86:5473-5477), pancreas-specific promoters (Edlund et al (1985) Science 230:912-916), 
and mammary gland-specific promoters (e.g., milk whey promoter; U.S. Patent No. 

1 5 4,873,3 1 6 and European Application Publication No. 264,1 66). Developmentally- 
regulated promoters are also encompassed, for example the murine hox promoters 
(Kessel and Gruss (1990) Science 249:374-379) and the a-fetoprotein promoter 
(Campes and Tilghman (1989) Genes Dev. 3:537-546). 

The expression characteristics of an endogenous hVR-1, hVR-2, and rVR-2 gene 

20 within a cell line or microorganism may be modified by inserting a heterologous DNA 
regulatory element into the genome of a stable cell line or cloned microorganism such 
that the inserted regulatory element is operatively linked with the endogenous hVR-1, 
hVR-2, and rVR-2 gene. For example, an endogenous hVR-1, hVR-2, and rVR-2 gene 
which is normally "trancriptionally silent", i.e., a hVR-1, hVR-2, and rVR-2 gene which 

25 is normally not expressed, or is expressed only at very low levels in a cell line or 

microorganism, may be activated by inserting a regulatory element which is capable of 
promoting the expression of a normally expressed gene product in that cell line or 
microorganism. Alternatively, a transcriptionally silent, endogenous hVR-1, hVR-2, 
and rVR-2 gene, may be activated by insertion of a promiscuous regulatory element that 

30 works across cell types. 
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A heterologous regulatory element may be inserted into a stable cell line or 
cloned microorganism, such that it is operatively linked with an endogenous hVR-1, 
hVR-2, and rVR-2 gene, using techniques, such as targeted homologous recombination, 
which are well known to those of skill in the art, and described e.g., in Chappel, U.S. 
5 Patent No.: 5,272,071; PCT publication No. WO 91/06667, published May 16, 1991. 

The invention further provides a recombinant expression vector comprising a 
DNA molecule of the invention cloned into the expression vector in an antisense 
orientation. That is, the DNA molecule is operatively linked to a regulatory sequence in 
a manner which allows for expression (by transcription of the DNA molecule) of an 

10 RNA molecule which is antisense to hVR-1, hVR-2, and rVR-2 mRNA. Regulatory 
sequences operatively linked to a nucleic acid cloned in the antisense orientation can be 
chosen which direct the continuous expression of the antisense RNA molecule in a 
variety of cell types, for instance viral promoters and/or enhancers, or regulatory 
sequences can be chosen which direct constitutive, tissue specific or cell type specific) 

1 5 expression of antisense RNA. The antisense expression vector can be in the form of a 
recombinant plasmid, phagemid or attenuated virus in which antisense nucleic acids are 
produced under the control of a high efficiency regulatory region, the activity of which 
can be determined by the cell type into which the vector is introduced. For a discussion 
of the regulation of gene expression using antisense genes see Weintraub, H. et al. , 

20 Antisense RNA as a molecular tool for genetic analysis, Reviews - Trends in Genetics, 
Vol. 1(1) 1986. 

Another aspect of the invention pertains to host cells into which an hVR- 1 , h VR- 
2, and rVR-2 nucleic acid molecule of the invention is introduced, e.g., an hVR-1, hVR- 
2, and rVR-2 nucleic acid molecule within a recombinant expression vector or an hVR- 

25 1, hVR-2, and rVR-2 nucleic acid molecule containing sequences which allow it to 

homologously recombine into a specific site of the host cell's genome. The terms "host 
cell" and "recombinant host cell" are used interchangeably herein. It is understood that 
such terms refer not only to the particular subject cell but to the progeny or potential 
progeny of such a cell. Because certain modifications may occur in succeeding 

30 generations due to either mutation or environmental influences, such progeny may not, 
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in fact, be identical to the parent cell, but are still included within the scope of the term 
as used herein. 

A host cell can be any prokaryotic or eukaryotic cell. For example, an hVR-1 , 
hVR-2, and rVR-2 protein can be expressed in bacterial cells such as E. coli, insect cells, 
5 yeast or mammalian cells (such as Chinese hamster ovary cells (CHO) or COS cells). 
Other suitable host cells are known to those skilled in the art. 

Vector DNA can be introduced into prokaryotic or eukaryotic cells via 
conventional transformation or transfection techniques. As used herein, the terms 
"transformation" and "transfection" are intended to refer to a variety of art-recognized 
1 0 techniques for introducing foreign nucleic acid (e.g., DNA) into a host cell, including 
calcium phosphate or calcium chloride co-precipitation, DEAE-dextran-mediated 
transfection, lipofection, or electroporation. Suitable methods for transforming or 
transfecting host cells can be found in Sambrook, et ai (Molecular Cloning: A 
Laboratory Manual 2nd, ed, Cold Spring Harbor Laboratory, Cold Spring Harbor 
1 5 Laboratory Press, Cold Spring Harbor, NY, 1989), and other laboratory manuals. 

For stable transfection of mammalian cells, it is known that, depending upon the 
expression vector and transfection technique used, only a small fraction of cells may 
integrate the foreign DNA into their genome. In order to identify and select these 
integrants, a gene that encodes a selectable marker (e.g., resistance to antibiotics) is 
20 generally introduced into the host cells along with the gene of interest, selectable 

markers include those which confer resistance to drugs, such as G418, hygromycin and 
methotrexate. Nucleic acid encoding a selectable marker can be introduced into a host 
cell on the same vector as that encoding an hVR-1 , hVR-2, and rVR-2 protein or can be 
introduced on a separate vector. Cells stably transfected with the introduced nucleic 
25 acid can be identified by drug selection (e.g., cells that have incorporated the selectable 
marker gene will survive, while the other cells die). 

A host cell of the invention, such as a prokaryotic or eukaryotic host cell in 
culture, can be used to produce (Le. 7 express) an hVR-1, hVR-2, and rVR-2 protein. 
Accordingly, the invention further provides methods for producing an hVR-1, hVR-2, 
30 and rVR-2 protein using the host cells of the invention. In one embodiment, the method 
comprises culturing the host cell of invention (into which a recombinant expression 
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vector encoding an hVR-1, hVR-2, and rVR-2 protein has been introduced) in a suitable 
medium such that an hVR-1, hVR-2, and rVR-2 protein is produced. In another 
embodiment, the method further comprises isolating an hVR-1, hVR-2, and rVR-2 
protein from the medium or the host cell. 
5 The host cells of the invention can also be used to produce non-human transgenic 

animals. For example, in one embodiment, a host cell of the invention is a fertilized 
oocyte or an embryonic stem cell into which hVR-1, hVR-2, and rVR-2-coding 
sequences have been introduced. Such host cells can then be used to create non-human 
transgenic animals in which exogenous hVR-1, hVR-2, and rVR-2 sequences have been 

10 introduced into their genome or homologous recombinant animals in which endogenous 
hVR-1 , hVR-2, and rVR-2 sequences have been altered. Such animals are useful for 
studying the function and/or activity of an hVR-1, hVR-2, and rVR-2 and for identifying 
and/or evaluating modulators of hVR-1, hVR-2, and rVR-2 activity. As used herein, a 
"transgenic animal" is a non-human animal, preferably a mammal, more preferably a 

15 rodent such as a rat or mouse, in which one or more of the cells of the animal includes a 
transgene. Other examples of transgenic animals include non-human primates, sheep, 
dogs, cows, goats, chickens, amphibians, and the like. A transgene is exogenous DNA 
which is integrated into the genome of a cell from which a transgenic animal develops 
and which remains in the genome of the mature animal, thereby directing the expression 

20 of an encoded gene product in one or more cell types or tissues of the transgenic animal. 
As used herein, a "homologous recombinant animal" is a non-human animal, preferably 
a mammal, more preferably a mouse, in which an endogenous hVR-1, hVR-2, and rVR- 
2 gene has been altered by homologous recombination between the endogenous gene 
and an exogenous DNA molecule introduced into a cell of the animal, e.g., an 

25 embryonic cell of the animal, prior to development of the animal. 

A transgenic animal of the invention can be created by introducing an hVR-1, 
hVR-2, and rVR-2 -encoding nucleic acid into the male pronuclei of a fertilized oocyte, 
e.g., by microinjection, retroviral infection, and allowing the oocyte to develop in a 
pseudopregnant female foster animal. The hVR-1, hVR-2, and rVR-2 cDNA sequence 

30 of SEQ ID NO: 1 , 3, 5, 7 or 9 can be introduced as a transgene into the genome of a non- 
human animal. Alternatively, a nonhuman homologue of a hVR-2 gene, such as a 
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mouse or rat hVR-2, e.g., the rVR-2 gene, can be used as a transgene. Alternatively, an 
hVR-1, hVR-2, and rVR-2 gene homologue, such as another member of the 
Capsaicin/Vanilloid family, can be isolated based on hybridization to the hVR-1, hVR-2, 
and rVR-2 cDNA sequences of SEQ ID NO:l, 3, 4, 6, 7, 9, 10, or 12, (described further 
5 in subsection I above) and used as a transgene. Intronic sequences and polyadenylation 
signals can also be included in the transgene to increase the efficiency of expression of 
the transgene. A tissue-specific regulatory sequence(s) can be operably linked to an 
hVR-1, hVR-2, and rVR-2 transgene to direct expression of an hVR-L hVR-2, and rVR- 
2 protein to particular cells. Methods for generating transgenic animals via embryo 

10 manipulation and microinjection, particularly animals such as mice, have become 

conventional in the art and are described, for example, in U.S. Patent Nos. 4,736,866 and 
4,870,009, both by Leder et al, U.S. Patent No. 4,873,191 by Wagner et al and in 
Hogan, B., Manipulating the Mouse Embryo, (Cold Spring Harbor Laboratory Press, 
Cold Spring Harbor, N.Y., 1986). Similar methods are used for production of other 

1 5 transgenic animals. A transgenic founder animal can be identified based upon the 

presence of an hVR-1, hVR-2, and rVR-2 transgene in its genome and/or expression of 
hVR-1, hVR-2, and rVR-2 mRNA in tissues or cells of the animals. A transgenic 
founder animal can then be used to breed additional animals carrying the transgene. 
Moreover, transgenic animals carrying a transgene encoding an hVR-1, hVR-2, and 

20 rVR-2 protein can further be bred to other transgenic animals carrying other transgenes. 

To create a homologous recombinant animal, a vector is prepared which contains 
at least a portion of an hVR-1, hVR-2, and rVR-2 gene into which a deletion, addition or 
substitution has been introduced to thereby alter, e.g., functionally disrupt, the hVR-1, 
hVR-2, and rVR-2 gene. The VR-1 or VR-2 gene can be a human gene (e.g., the cDNA 

25 of SEQ ID NO: 1 , 3, 5, 4, 6, 7, or 9), but more preferably, is a non-human homologue of 
a hVR-1 and hVR-2 gene (e.g., the cDNA of SEQ ID NO: 10 or 12, or a cDNA isolated 
by stringent hybridization with the nucleotide sequence of SEQ ID NO: 1, 3, 5, 4, 6, 7, 
or 9). For example, a mouse VR-2 gene can be used to construct a homologous 
recombination nucleic acid molecule, e.g., a vector, suitable for altering an endogenous 

30 VR-2 gene in the mouse genome. In a embodiment, the homologous recombination 
nucleic acid molecule is designed such that, upon homologous recombination, the 
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endogenous hVR-1, hVR-2, and rVR-2 gene is functionally disrupted (i.e. y no longer 
encodes a functional protein; also referred to as a "knock out" vector). Alternatively, the 
homologous recombination nucleic acid molecule can be designed such that, upon 
homologous recombination, the endogenous hVR- 1 , hVR-2, and rVR-2 gene is mutated 
5 or otherwise altered but still encodes functional protein (e.g., the upstream regulatory 
region can be altered to thereby alter the expression of the endogenous hVR-1, hVR-2, 
and rVR-2 protein). In the homologous recombination nucleic acid molecule, the altered 
portion of the hVR-1, hVR-2, and rVR-2 gene is flanked at its 5' and 3' ends by 
additional nucleic acid sequence of the hVR-1, hVR-2, and rVR-2 gene to allow for 
1 0 homologous recombination to occur between the exogenous h VR- 1 , hVR-2, and rVR-2 
gene carried by the homologous recombination nucleic acid molecule and an 
endogenous hVR-1, hVR-2, and rVR-2 gene in a cell, e.g., an embryonic stem cell. The 
additional flanking hVR-1, hVR-2, and rVR-2 nucleic acid sequence is of sufficient 
length for successful homologous recombination with the endogenous gene. Typically, 
15 several kilobases of flanking DNA (both at the 5' and 3 1 ends) are included in the 
homologous recombination nucleic acid molecule (see, e.g., Thomas, K.R. and 
Capecchi, M. R. (1987) Cell 51 :503 for a description of homologous recombination 
vectors). The homologous recombination nucleic acid molecule is introduced into a cell, 
e.g., an embryonic stem cell line (e.g., by electroporation) and cells in which the 
20 introduced hVR-1, hVR-2, and rVR-2 gene has homologously recombined with the 
endogenous hVR-1, hVR-2, and rVR-2 gene are selected (see e.g., Li, E. et al. (1992) 
Cell 69:915). The selected cells can then injected into a blastocyst of an animal (e.g., a 
mouse) to form aggregation chimeras (see e.g., Bradley, A. in Teratocarcinomas and 
Embryonic Stem Cells: A Practical Approach, EJ. Robertson, ed. (IRL, Oxford, 1987) 
25 pp. 1 13-152). A chimeric embryo can then be implanted into a suitable pseudopregnant 
female foster animal and the embryo brought to term. Progeny harboring the 
homologously recombined DNA in their genu cells can be used to breed animals in 
which all cells of the animal contain the homologously recombined DNA by germline 
transmission of the transgene. Methods for constructing homologous recombination 
30 nucleic acid molecules, e.g., vectors, or homologous recombinant animals are described 
further in Bradley, A. (1991) Current Opinion in Biotechnology 2:823-829 and in PCT 
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International Publication Nos.: WO 90/1 1354 by Le Mouellec et al; WO 91/01 140 by 
Smithies et al ; WO 92/0968 by Zijlstra et al ; and WO 93/04 1 69 by Berns et al 

In another embodiment, transgenic non-humans animals can be produced which 
contain selected systems which allow for regulated expression of the transgene. One 
5 example of such a system is the cre/loxP recombinase system of bacteriophage PI . For 
a description of the cre/loxP recombinase system, see, e.g., Lakso et al (1992) Proc. 
Natl Acad. Sci. USA 89:6232-6236. Another example of a recombinase system is the 
FLP recombinase system of Saccharomyces cerevisiae (O'Gorman et al (1991) Science 
251:1351-1355. If a cre/loxP recombinase system is used to regulate expression of the 
1 0 transgene, animals containing transgenes encoding both the Cre recombinase and a 

selected protein are required. Such animals can be provided through the construction of 
"double 11 transgenic animals, e.g., by mating two transgenic animals, one containing a 
transgene encoding a selected protein and the other containing a transgene encoding a 
recombinase. 

1 5 Clones of the non-human transgenic animals described herein can also be 

produced according to the methods described in Wilmut, I. et al (1997) Nature 385:810- 
813 and PCT International Publication Nos. WO 97/07668 and WO 97/07669. In brief, 
a cell, e.g., a somatic cell, from the transgenic animal can be isolated and induced to exit 
the growth cycle and enter G Q phase. The quiescent cell can then be fused, e.g., through 

20 the use of electrical pulses, to an enucleated oocyte from an animal of the same species 
from which the quiescent cell is isolated. The recontructed oocyte is then cultured such 
that it develops to morula or blastocyte and then transferred to pseudopregnant female 
foster animal. The offspring borne of this female foster animal will be a clone of the 
animal from which the cell, e.g., the somatic cell, is isolated. 

25 

IV. Pharmaceutical Compositions 

The hVR-1, hVR-2, and rVR-2 nucleic acid molecules, fragments of hVR-1, 
hVR-2, and rVR-2 proteins, and anti-hVR-1, anti-hVR-2, and anti-rVR-2 antibodies 
(also referred to herein as "active compounds") of the invention can be incorporated into 
30 pharmaceutical compositions suitable for administration. Such compositions typically 
comprise the nucleic acid molecule, protein, or antibody and a pharmaceutically 
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acceptable carrier. As used herein the language "pharmaceutically acceptable carrier" is 
intended to include any and all solvents, dispersion media, coatings, antibacterial and 
antifungal agents, isotonic and absorption delaying agents, and the like, compatible with 
pharmaceutical administration. The use of such media and agents for pharmaceutically 
5 active substances is well known in the art. Except insofar as any conventional media or 
agent is incompatible with the active compound, use thereof in the compositions is 
contemplated- Supplementary active compounds can also be incorporated into the 
compositions. 

A pharmaceutical composition of the invention is formulated to be compatible 
1 0 with its intended route of administration. Examples of routes of administration include 
parenteral, e.g., intravenous, intradermal, subcutaneous, oral (e.g., inhalation), 
transdermal (topical), transmucosal, and rectal administration. Solutions or suspensions 
used for parenteral, intradermal, or subcutaneous application can include the following 
components: a sterile diluent such as water for injection, saline solution, fixed oils, 
1 5 polyethylene glycols, glycerine, propylene glycol or other synthetic solvents; 

antibacterial agents such as benzyl alcohol or methyl parabens; antioxidants such as 
ascorbic acid or sodium bisulfite; chelating agents such as ethylenediaminetetraacetic 
acid; buffers such as acetates, citrates or phosphates and agents for the adjustment of 
tonicity such as sodium chloride or dextrose. pH can be adjusted with acids or bases, 
20 such as hydrochloric acid or sodium hydroxide. The parenteral preparation can be 
enclosed in ampoules, disposable syringes or multiple dose vials made of glass or 
plastic. 

Pharmaceutical compositions suitable for injectable use include sterile aqueous 
solutions (where water soluble) or dispersions and sterile powders for the 

25 extemporaneous preparation of sterile injectable solutions or dispersion. For 

intravenous administration, suitable carriers include physiological saline, bacteriostatic 
water, Cremophor EL™ (BASF, Parsippany, NJ) or phosphate buffered saline (PBS). In 
all cases, the composition must be sterile and should be fluid to the extent that easy 
syringability exists. It must be stable under the conditions of manufacture and storage 

30 and must be preserved against the contaminating action of microorganisms such as 
bacteria and fungi. The carrier can be a solvent or dispersion medium containing, for 
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example, water, ethanol, polyol (for example, glycerol, propylene glycol, and liquid 
polyetheylene glycol, and the like), and suitable mixtures thereof. The proper fluidity 
can be maintained, for example, by the use of a coating such as lecithin, by the 
maintenance of the required particle size in the case of dispersion and by the use of 
5 surfactants. Prevention of the action of microorganisms can be achieved by various 
antibacterial and antifungal agents, for example, parabens, chlorobutanol, phenol, 
ascorbic acid, thimerosal, and the like. In many cases, it will be preferable to include 
isotonic agents, for example, sugars, polyalcohols such as manitol, sorbitol, sodium 
chloride in the composition. Prolonged absorption of the injectable compositions can be 
10 brought about by including in the composition an agent which delays absorption, for 
example, aluminum monostearate and gelatin. 

Sterile injectable solutions can be prepared by incorporating the active 
compound (e.g., a fragment of an hVR-1, hVR-2, and rVR-2 protein or an anti-hVR-1, 
anti-hVR-2, and anti-rVR-2 antibody) in the required amount in an appropriate solvent 

15 with one or a combination of ingredients enumerated above, as required, followed by 
filtered sterilization. Generally, dispersions are prepared by incorporating the active 
compound into a sterile vehicle which contains a basic dispersion medium and the 
required other ingredients from those enumerated above. In the case of sterile powders 
for the preparation of sterile injectable solutions, the methods of preparation are vacuum 

20 drying and freeze-drying which yields a powder of the active ingredient plus any 
additional desired ingredient from a previously sterile-filtered solution thereof. 

Oral compositions generally include an inert diluent or an edible carrier. They 
can be enclosed in gelatin capsules or compressed into tablets. For the purpose of oral 
therapeutic administration, the active compound can be incorporated with excipients and 

25 used in the form of tablets, troches, or capsules, oral compositions can also be prepared 
using a fluid carrier for use as a mouthwash, wherein the compound in the fluid carrier is 
applied orally and swished and expectorated or swallowed. Pharmaceutically 
compatible binding agents, and/or adjuvant materials can be included as part of the 
composition. The tablets, pills, capsules, troches and the like can contain any of the 

30 following ingredients, or compounds of a similar nature: a binder such as 

microcrystalline cellulose, gum tragacanth or gelatin; an excipient such as starch or 
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lactose, a disintegrating agent such as alginic acid, Primogel, or corn starch; a lubricant 
such as magnesium stearate or Sterotes; a glidant such as colloidal silicon dioxide; a 
sweetening agent such as sucrose or saccharin; or a flavoring agent such as peppermint, 
methyl salicylate, or orange flavoring. 
5 For administration by inhalation, the compounds are delivered in the form of an 

aerosol spray from pressured container or dispenser which contains a suitable propellant, 
e.g., a gas such as carbon dioxide, or a nebulizer. 

Systemic administration can also be by transmucosal or transdermal means. For 
transmucosal or transdermal administration, penetrants appropriate to the barrier to be 

10 permeated are used in the formulation. Such penetrants are generally known in the art, 
and include, for example, for transmucosal administration, detergents, bile salts, and 
fusidic acid derivatives. Transmucosal administration can be accomplished through the 
use of nasal sprays or suppositories. For transdermal administration, the active 
compounds are formulated into ointments, salves, gels, or creams as generally known in 

15 the art. 

The compounds can also be prepared in the form of suppositories (e.g., with 
conventional suppository bases such as cocoa butter and other glycerides) or retention 
enemas for rectal delivery. 

In one embodiment, the active compounds are prepared with carriers that will 

20 protect the compound against rapid elimination from the body, such as a controlled 
release formulation, including implants and microencapsulated delivery systems. 
Biodegradable, biocompatible polymers can be used, such as ethylene vinyl acetate, 
polyanhydrides, polyglycolic acid, collagen, polyorthoesters, and polylactic acid. 
Methods for preparation of such formulations will be apparent to those skilled in the art. 

25 The materials can also be obtained commercially from Alza Corporation and Nova 

Pharmaceuticals, Inc. Liposomal suspensions (including liposomes targeted to infected 
cells with monoclonal antibodies to viral antigens) can also be used as pharmaceutical^ 
acceptable carriers. These can be prepared according to methods known to those skilled 
in the art, for example, as described in U.S. Patent No. 4,522,81 1 . 
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It is especially advantageous to formulate oral or parenteral compositions in 
dosage unit form for ease of administration and uniformity of dosage. Dosage unit form 
as used herein refers to physically discrete units suited as unitary dosages for the subject 
to be treated; each unit containing a predetermined quantity of active compound 
5 calculated to produce the desired therapeutic effect in association with the required 
pharmaceutical carrier. The specification for the dosage unit forms of the invention are 
dictated by and directly dependent on the unique characteristics of the active compound 
and the particular therapeutic effect to be achieved, and the limitations inherent in the art 
of compounding such an active compound for the treatment of individuals. 
1 0 Toxicity and therapeutic efficacy of such compounds can be determined by 

standard pharmaceutical procedures in cell cultures or experimental animals, e.g., for 
determining the LD50 (the dose lethal to 50% of the population) and the ED50 (the dose 
therapeutically effective in 50% of the population). The dose ratio between toxic and 
therapeutic effects is the therapeutic index and it can be expressed as the ratio 
15 LD50/ED50. Compounds which exhibit large therapeutic indices are preferred. While 
compounds that exhibit toxic side effects may be used, care should be taken to design a 
delivery system that targets such compounds to the site of affected tissue in order to 
minimize potential damage to uninfected cells and, thereby, reduce side effects. 

The data obtained from the cell culture assays and animal studies can be used in 
20 formulating a range of dosage for use in humans. The dosage of such compounds lies 
preferably within a range of circulating concentrations that include the ED50 with little 
or no toxicity. The dosage may vary within this range depending upon the dosage form 
employed and the route of administration utilized. For any compound used in the 
method of the invention, the therapeutically effective dose can be estimated initially 
25 from cell culture assays. A dose may be formulated in animal models to achieve a 

circulating plasma concentration range that includes the IC50 (i.e., the concentration of 
the test compound which achieves a half-maximal inhibition of symptoms) as 
determined in cell culture. Such information can be used to more accurately determine 
useful doses in humans. Levels in plasma may be measured, for example, by high 
30 performance liquid chromatography. 
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As defined herein, a therapeutically effective amount of protein or polypeptide 
(i.e., an effective dosage) ranges from about 0.001 to 30 mg/kg body weight, preferably 
about 0.01 to 25 mg/kg body weight, more preferably about 0.1 to 20 mg/kg body 
weight, and even more preferably about 1 to 10 mg/kg, 2 to 9 mg/kg, 3 to 8 mg/kg, 4 to 
5 7 mg/kg, or 5 to 6 mg/kg body weight. The skilled artisan will appreciate that certain 
factors may influence the dosage required to effectively treat a subject, including but not 
limited to the severity of the disease or disorder, previous treatments, the general health 
and/or age of the subject, and other diseases present. Moreover, treatment of a subject 
with a therapeutically effective amount of a protein, polypeptide, or antibody can 

10 include a single treatment or, preferably, can include a series of treatments. In a 

preferred example, a subject is treated with antibody, protein, or polypeptide in the range 
of between about 0. 1 to 20 mg/kg body weight, one time per week for between about 1 
to 1 0 weeks, preferably between 2 to 8 weeks, more preferably between about 3 to 7 
weeks, and even more preferably for about 4, 5, or 6 weeks. It will also be appreciated 

15 that the effective dosage of antibody, protein, or polypeptide used for treatment may 
increase or decrease over the course of a particular treatment. Changes in dosage may 
result and become apparent from the results of diagnostic assays as described herein. 

The present invention encompasses agents which modulate expression or 
activity. An agent may, for example, be a small molecule. For example, such small 

20 molecules include, but are not limited to, peptides, peptidomimetics, amino acids, amino 
acid analogs, polynucleotides, polynucleotide analogs, nucleotides, nucleotide analogs, 
organic or inorganic compounds (i.e,. including heteroorganic and organometallic 
compounds) having a molecular weight less than about 10,000 grams per mole, organic 
or inorganic compounds having a molecular weight less than about 5,000 grams per 

25 mole, organic or inorganic compounds having a molecular weight less than about 1 ,000 
grams per mole, organic or inorganic compounds having a molecular weight less than 
about 500 grams per mole, and salts, esters, and other pharmaceutically acceptable forms 
of such compounds. 

It is understood that appropriate doses of small molecule agents depends upon a 
30 number of factors within the ken of the ordinarily skilled physician, veterinarian, or 
researcher. The dose(s) of the small molecule will vary, for example, depending upon 



BNSDOCID: <WO 0029577A1_I_> 



WO 00/29577 PCT/US99/26701 

-58- 

the identity, size, and condition of the subject or sample being treated, further depending 
upon the route by which the composition is to be administered, if applicable, and the 
effect which the practitioner desires the small molecule to have upon the nucleic acid or 
polypeptide of the invention. 
5 Exemplary doses include milligram or microgram amounts of the small molecule 

per kilogram of subject or sample weight (e.g., about 1 microgram per kilogram to about 
500 milligrams per kilogram, about 100 micrograms per kilogram to about 5 milligrams 
per kilogram, or about 1 microgram per kilogram to about 50 micrograms per kilogram. 
It is furthermore understood that appropriate doses of a small molecule depend upon the 

1 0 potency of the small molecule with respect to the expression or activity to be modulated. 
Such appropriate doses may be determined using the assays described herein. 

When one or more of these small molecules is to be administered to an animal 
{e.g. , a human) in order to modulate expression or activity of a polypeptide or nucleic 
acid of the invention, a physician, veterinarian, or researcher may, for example, 

1 5 prescribe a relatively low dose at first, subsequently increasing the dose until an 

appropriate response is obtained. In addition, it is understood that the specific dose level 
for any particular animal subject will depend upon a variety of factors including the 
activity of the specific compound employed, the age, body weight, general health, 
gender, and diet of the subject, the time of administration, the route of administration, 

20 the rate of excretion, any drug combination, and the degree of expression or activity to 
be modulated. 

The nucleic acid molecules of the invention can be inserted into vectors and used 
as gene therapy vectors. Gene therapy vectors can be delivered to a subject by, for 
example, intravenous injection, local administration (see U.S. Patent 5,328,470) or by 

25 stereotactic injection (see e.g., Chen et al (1994) Proc. Natl. Acad. Set USA 91 :3054- 
3057). The pharmaceutical preparation of the gene therapy vector can include the gene 
therapy vector in an acceptable diluent, or can comprise a slow release matrix in which 
the gene delivery vehicle is imbedded. Alternatively, where the complete gene delivery 
vector can be produced intact from recombinant cells, e.g., retroviral vectors, the 

30 pharmaceutical preparation can include one or more cells which produce the gene 
delivery system. 
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The pharmaceutical compositions can be included in a container, pack, or 
dispenser together with instructions for administration. 

V. Uses and Methods of the Invention 
5 The nucleic acid molecules, proteins, protein homologues, and antibodies 

described herein can be used in one or more of the following methods: a) screening 
assays; b) predictive medicine (e.g., diagnostic assays, prognostic assays, monitoring 
clinical trials, and pharmacogenetics); and c) methods of treatment (e.g., therapeutic and 
prophylactic). As described herein, an hVR-1, hVR-2, and rVR-2 protein of the 

10 invention has one or more of the following activities: (1 ) it interacts with a non-hVR-1, 
non-hVR-2, and non-rVR-2 protein molecule, e.g., a vanilloid compound such as 
capsaicin; (2) it modulates intracellular calcium concentration; (3) it activates an hVR- 
1, hVR-2, and rVR-2-dependent signal transduction pathway; and (4) it modulates a pain 
signaling mechanism, and, thus, can be used to, for example, (1) modulate the 

15 interaction with a non-hVR-1, non-hVR-2, and non-rVR-2 protein molecule; (2) 

modulate intracellular calcium concentration; (3) activate an hVR-1, hVR-2, and rVR- 
2-dependent signal transduction pathway; and (4) modulate a pain signaling mechanism. 

The isolated nucleic acid molecules of the invention can be used, for example, to 
express hVR-1, hVR-2, and rVR-2 protein (e.g., via a recombinant expression vector in 

20 a host cell in gene therapy applications), to detect hVR-1, hVR-2, and rVR-2 mRNA 
(e.g., in a biological sample) or a genetic alteration in an hVR-1, hVR-2, and rVR-2 
gene, and to modulate hVR-1, hVR-2, and rVR-2 activity, as described further below. 
The hVR-1, hVR-2, and rVR-2 proteins can be used to screen for naturally occurring 
hVR-1, hVR-2, and rVR-2 substrates, to screen for drugs or compounds which modulate 

25 hVR-1, hVR-2, and rVR-2 activity, as well as to treat disorders characterized by 

insufficient or excessive production of hVR-1, hVR-2, and rVR-2 protein or production 
of hVR-1, hVR-2, and rVR-2 protein forms which have decreased or aberrant activity 
compared to hVR-1, hVR-2, and rVR-2 wild type protein (e.g., pain disorders). 
Moreover, the anti-hVR-1, anti-hVR-2, and anti-rVR-2 antibodies of the invention can 

30 be used to detect and isolate hVR- 1 , hVR-2, and rVR-2 proteins, regulate the 
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bioavailability of hVR-1, hVR-2, and rVR-2 proteins, arid modulate hVR-1, hVR-2, and 
rVR-2 activity. 



A. Screening Assays : 
5 The invention provides a method (also referred to herein as a "screening assay") 

for identifying modulators, i.e., candidate or test compounds or agents (e.g., peptides, 
peptidomimetics, small molecules or other drugs) which bind to hVR-1, hVR-2, and 
rVR-2 proteins, have a stimulatory or inhibitory effect on, for example, hVR-1, hVR-2, 
and rVR-2 expression or hVR-1, hVR-2, and rVR-2 activity, or have a stimulatory or 
10 inhibitory effect on, for example, the expression or activity of hVR-1, hVR-2, and rVR- 
2 substrate. 

In one embodiment, the invention provides assays for screening candidate or test 
compounds which are substrates of an hVR-1, hVR-2, and rVR-2 protein or polypeptide 
or biologically active portion thereof. In another embodiment, the invention provides 

1 5 assays for screening candidate or test compounds which bind to or modulate the activity 
of an hVR-1, hVR-2, and rVR-2 protein or polypeptide or biologically active portion 
thereof. The test compounds of the present invention can be obtained using any of the 
numerous approaches in combinatorial library methods known in the art, including: 
biological libraries; spatially addressable parallel solid phase or solution phase libraries; 

20 synthetic library methods requiring deconvolution; the 'one-bead one-compound' library 
method; and synthetic library methods using affinity chromatography selection. The 
biological library approach is limited to peptide libraries, while the other four 
approaches are applicable to peptide, non-peptide oligomer or small molecule libraries 
of compounds (Lam, K.S. (1997) Anticancer Drug Des. 12:145). 

25 Examples of methods for the synthesis of molecular libraries can be found in the 

art, for example in: DeWitt et al (1993) Proc. Natl Acad. ScL U.S.A. 90:6909; Erb et 
al (1994) Proc. Natl Acad. Set USA 91:1 1422; Zuckermann et al. (1994). J. Med. 
Chem. 37:2678; Cho et al (1993) Science 261:1303; Carrell et al (1994) Angew. Chem. 
Int. Ed. Engl 33:2059; Carell et al (1994) Angew. Chem. Int. Ed Engl 33:2061; and in 

30 Gallop et al (1 994) J. Med. Chem. 37: 1233. 
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Libraries of compounds may be presented in solution (e.g., Houghten (1992) 
Biotechniques 13:412-421), or on beads (Lam (1991) Nature 354:82-84), chips (Fodor 
(1993) Nature 364:555-556), bacteria (Ladner USP 5,223,409), spores (Ladner USP 
•409), plasmids (Cull et al (1992) Proc Natl Acad Sci USA 89:1865-1869) or on phage 
5 (Scott and Smith (1990) Science 249:386-390); (Devlin (1990) Science 249:404-406); 
(Cwirla et al (1990) Proc. Natl Acad Sci. 87:6378-6382); (Felici (1991) J. Mol Biol 
222:301-310); (Ladner supra.). 

In one embodiment, an assay is a cell-based assay in which a cell, e.g., a 
neuronal cell, which expresses an hVR-1, hVR-2, and rVR-2 protein or biologically 

1 0 active portion thereof is contacted with a test compound and the ability of the test 

compound to modulate hVR-1, hVR-2, and rVR-2 activity is determined. Determining 
the ability of the test compound to modulate hVR-1, hVR-2, and rVR-2 activity can be 
accomplished by monitoring, for example, intracellular calcium concentration or 
membrane depolarization by, e.g., patch-clamp recordings in whole-cell, inside-out, and 

15 outside-out configurations (as described in, for example, Tominaga M. et al (1998) 

Neuron 21 :53 1-543). Determining the ability of the test compound to modulate hVR-1, 
hVR-2, and rVR-2 activity can further be accomplished by monitoring the activity of an 
hVR-1, hVR-2, and rVR-2 -regulated transcription factor. The cell, for example, can be 
of mammalian origin, e.g., a neuronal cell. 

20 The ability of the test compound to modulate hVR-1, hVR-2, and rVR-2 binding 

to a substrate or to bind to hVR- 1 , hVR-2, and rVR-2 can also be determined. 
Determining the ability of the test compound to modulate hVR-1, hVR-2, and rVR-2 
binding to a substrate can be accomplished, for example, by coupling the hVR-1, hVR- 
2, and rVR-2 substrate with a radioisotope or enzymatic label such that binding of the 

25 hVR-1, hVR-2, and rVR-2 substrate to hVR-1, hVR-2, and rVR-2 can be determined by 
detecting the labeled hVR-1, hVR-2, and rVR-2 substrate in a complex. Determining 
the ability of the test compound to bind hVR-1, hVR-2, and rVR-2 can be accomplished, 
for example, by coupling the compound with a radioisotope or enzymatic label such that 
binding of the compound to hVR-1, hVR-2, and rVR-2 can be determined by detecting 

30 the labeled hVR-1, hVR-2, and rVR-2 compound in a complex. For example, 

compounds (e.g., hVR-1, hVR-2, and rVR-2 substrates) can be labeled with 125 I, 35 S, 
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14 C, or 3 H, either directly or indirectly, and the radioisotope detected by direct counting 
of radioemmission or by scintillation counting. Alternatively, compounds can be 
enzymatically labeled with, for example, horseradish peroxidase, alkaline phosphatase, 
or luciferase, and the enzymatic label detected by determination of conversion of an 
5 appropriate substrate to product. 

It is also within the scope of this invention to determine the ability of a 
compound (e.g., an hVR-1, hVR-2, and rVR-2 substrate) to interact with hVR-1, hVR-2, 
and rVR-2 without the labeling of any of the interactants. For example, a 
microphysiometer can be used to detect the interaction of a compound with hVR-1, 

10 hVR-2, and rVR-2 without the labeling of either the compound or the hVR-1, hVR-2, 
andrVR-2. McConnell, H. M. et al. (1992) Science 257:1906-1912. As used herein, a 
"microphysiometer" (e.g., Cytosensor) is an analytical instrument that measures the rate 
at which a cell acidifies its environment using a light-addressable potentiometric sensor 
(LAPS). Changes in this acidification rate can be used as an indicator of the interaction 

1 5 between a compound and hVR-1 , hVR-2, and rVR-2. 

In yet another embodiment, an assay of the present invention is a cell-free assay 
in which an hVRrl, hVR-2, and rVR-2 protein or biologically active portion thereof is 
contacted with a test compound and the ability of the test compound to bind to the hVR- 
1, hVR-2, and rVR-2 protein or biologically active portion thereof is determined. 

20 biologically active portions of the hVR-1, hVR-2, and rVR-2 proteins to be used in 

assays of the present invention include fragments which participate in interactions with 
non-hVR-1, non-hVR-2, and non-rVR-2 molecules, e.g., fragments with high surface 
probability scores. Binding of the test compound to the hVR-1, hVR-2, and rVR-2 
protein can be determined either directly or indirectly as described above. In a 

25 embodiment, the assay includes contacting the hVR-1, hVR-2, and rVR-2 protein or 

biologically active portion thereof with a known compound which binds hVR-1 , hVR-2, 
and rVR-2 to form an assay mixture, contacting the assay mixture with a test compound, 
and determining the ability of the test compound to interact with an hVR-1, hVR-2, and 
rVR-2 protein, wherein determining the ability of the test compound to interact with an 

30 hVR-1, hVR-2, and rVR-2 protein comprises determining the ability of the test 
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compound to preferentially bind to hVR-1, hVR-2, and rVR-2 or biologically active 
portion thereof as compared to the known compound. 

In another embodiment, the assay is a cell-free assay in which an hVR-1. hVR-2, 
and rVR-2 protein or biologically active portion thereof is contacted with a test 
5 compound and the ability of the test compound to modulate (e.g., stimulate or inhibit) 
the activity of the hVR-1 , hVR-2, and rVR-2 protein or biologically active portion 
thereof is determined. Determining the ability of the test compound to modulate the 
activity of an hVR-1, hVR-2, and rVR-2 protein can be accomplished, for example, by 
determining the ability of the hVR-1, hVR-2, and rVR-2 protein to bind to an hVR-1, 
1 0 hVR-2, and rVR-2 target molecule, e.g. , a vanilloid compound such as capsaicin, by one 
of the methods described above for determining direct binding. Determining the ability 
of the hVR-1, hVR-2, and rVR-2 protein to bind to an hVR-1, hVR-2, and rVR-2 target 
molecule can also be accomplished using a technology such as real-time Biomolecular 
Interaction Analysis (BIA). Sjolander, S. and Urbaniczky, C. (1991) Anal. Chem. 
15 63:2338-2345 and Szabo et al. (1995) Curr. Opin. Struct. Biol. 5:699-705. As used 
herein, "BIA" is a technology for studying biospecific interactions in real time, without 
labeling any of the interactants (e.g., BIAcore). Changes in the optical phenomenon of 
surface plasmon resonance (SPR) can be used as an indication of real-time reactions 

between biological molecules. 

20 In an alternative embodiment, determining the ability of the test compound to 

modulate the activity of an hVR-1, hVR-2, and rVR-2 protein can be accomplished by 
determining the ability of the hVR-1, hVR-2, and rVR-2 protein to further modulate the 
activity of a downstream effector of an hVR-1, hVR-2, and rVR-2 target molecule. For 
example, the activity of the effector molecule on an appropriate target can be determined 

25 or the binding of the effector to an appropriate target can be determined as previously 
described. 

In yet another embodiment, the cell-free assay involves contacting an hVR-1, 
hVR-2, and rVR-2 protein or biologically active portion thereof with a known 
compound which binds the hVR-1, hVR-2, and rVR-2 protein to form an assay mixture, 
30 contacting the assay mixture with a test compound, and determining the ability of the 
test compound to interact with the hVR-1, hVR-2, and rVR-2 protein, wherein 
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determining the ability of the test compound to interact with the hVR-1, hVR-2, and 
rVR-2 protein comprises determining the ability of the hVR-1, hVR-2, and rVR-2 
protein to preferentially bind to or modulate the activity of an hVR-1, hVR-2, and rVR-2 
target molecule. 

5 The cell-free assays of the present invention are amenable to use of both soluble 

and/or membrane-bound forms of isolated proteins (e.g., hVR-1 , hVR-2, and rVR-2 
proteins or biologically active portions thereof). In the case of cell-free assays in which 
a membrane-bound form of an isolated protein is used it may be desirable to utilize a 
solubilizing agent such that the membrane-bound form of the isolated protein is 

10 maintained in solution. Examples of such solubilizing agents include non-ionic 

detergents such as n-octylglucoside, n-dodecylglucoside, n-dodecylmaltoside, octanoyl- 
N-methylglucamide, decanoyl-N-methylglucamide, Triton® X-100, Triton® X-l 14, 
Thesit®, Isotridecypoly(ethylene glycol ether) n , 3-[(3- 
cholamidopropyl)dimethylamminio]-l -propane sulfonate (CHAPS), 3-[(3- 

1 5 cholamidopropyl)dimethylamminio]-2-hydroxy-l -propane sulfonate (CHAPSO), or N- 
dodecyl=N,N-dimethyl-3-ammonio- 1 -propane sulfonate. 

In more than one embodiment of the above assay methods of the present 
invention, it may be desirable to immobilize either hVR-1, hVR-2, and rVR-2 or its 
target molecule to facilitate separation of complexed from uncomplexed forms of one or 

20 both of the proteins, as well as to accommodate automation of the assay. Binding of a 
test compound to an hVR-1, hVR-2, and rVR-2 protein, or interaction of an hVR-1, 
hVR-2, and rVR-2 protein with a target molecule in the presence and absence of a 
candidate compound, can be accomplished in any vessel suitable for containing the 
reactants. Examples of such vessels include microtitre plates, test tubes, and micro- 

25 centrifuge tubes. In one embodiment, a fusion protein can be provided which adds a 
domain that allows one or both of the proteins to be bound to a matrix. For example, 
glutathione-S-transferase/ hVR-1, hVR-2, and rVR-2 fusion proteins or glutathione-S- 
transferase/target fusion proteins can be adsorbed onto glutathione sepharose beads 
(Sigma Chemical, St. Louis, MO) or glutathione derivatized microtitre plates, which are 
30 then combined with the test compound or the test compound and either the non-adsorbed 
target protein or hVR-1, hVR-2, and rVR-2 protein, and the mixture incubated under 
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conditions conducive to complex formation {e.g., at physiological conditions for salt and 
P H). Following incubation, the beads or microtitre plate wells are washed to remove 
any unbound components, the matrix immobilized in the case of beads, complex 
determined either directly or indirectly, for example, as described above. Alternatively, 
5 the complexes can be dissociated from the matrix, and the level of hVR-1, hVR-2, and 
rVR-2 binding or activity determined using standard techniques. 

Other techniques for immobilizing proteins on matrices can also be used in the 
screening assays of the invention. For example, either an hVR-1, hVR-2, and rVR-2 
protein or an hVR-1, hVR-2, and rVR-2 target molecule can be immobilized utilizing 
10 conjugation of biotin and streptavidin. Biotinylated hVR-1, hVR-2, and rVR-2 protein 
or target molecules can be prepared from biotin-NHS (N-hydroxy-succinimide) using 
techniques known in the art (e.g., biotinylation kit, Pierce Chemicals, Rockford, IL), and 
immobilized in the wells of streptavidin-coated 96 well plates (Pierce Chemical). 
Alternatively, antibodies reactive with hVR- 1 , hVR-2, and rVR-2 protein or target 
15 molecules but which do not interfere with binding of the hVR-1, hVR-2, and rVR-2 
protein to its target molecule can be derivatized to the wells of the plate, and unbound 
target or hVR-1, hVR-2, and rVR-2 protein trapped in the wells by antibody 
conjugation. Methods for detecting such complexes, in addition to those described 
above for the GST-immobilized complexes, include immunodetection of complexes 
20 using antibodies reactive with the hVR-1, hVR-2, and rVR-2 protein or target molecule, 
as well as enzyme-linked assays which rely on detecting an enzymatic activity 
associated with the hVR-1, hVR-2, and rVR-2 protein or target molecule. 

In another embodiment, modulators of hVR-1, hVR-2, and rVR-2 expression are 
identified in a method wherein a cell is contacted with a candidate compound and the 
25 expression of hVR-1, hVR-2, and rVR-2 mRNA or protein in the cell is determined. 
The level of expression of hVR-1, hVR-2, and rVR-2 mRNA or protein in the presence 
of the candidate compound is compared to the level of expression of hVR-1, hVR-2, and 
rVR-2 mRNA or protein in the absence of the candidate compound. The candidate 
compound can then be identified as a modulator of hVR-1, hVR-2, and rVR-2 
30 expression based on this comparison. For example, when expression of hVR-1 , hVR-2, 
and rVR-2 mRNA or protein is greater (statistically significantly greater) in the presence 
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of the candidate compound than in its absence, the candidate compound is identified as a 
stimulator of hVR-1, hVR-2, and rVR-2 mRNA or protein expression. Alternatively, 
when expression of hVR-1 , hVR-2, and rVR-2 mRNA or protein is less (statistically 
significantly less) in the presence of the candidate compound than in its absence, the 
5 candidate compound is identified as an inhibitor of hVR-1, hVR-2, and rVR-2 mRNA or 
protein expression. The level of hVR-1, hVR-2, and rVR-2 mRNA or protein 
expression in the cells can be determined by methods described herein for detecting 
hVR-1, hVR-2, and rVR-2 mRNA or protein. 

'in yet another aspect of the invention, the hVR-1, hVR-2, and rVR-2 proteins 
10 can be used as "bait proteins" in a two-hybrid assay or three-hybrid assay (see, e.g., U.S. 
Patent No. 5,283,317; Zervos et al. (1993) Cell 72:223-232; Madura et al. (1993) J. 
Biol. Chem. 268:12046-12054; Bartel et al. (1993) Biotechniques 14:920-924; Iwabuchi 
et al. (1993) Oncogene 8:1693-1696; and Brent WO94/10300), to identify other 
proteins, which bind to or interact with hVR-1, hVR-2, and rVR-2 ("hVR-1 -binding 
15 proteins", "hVR-2-binding proteins", and "rVR-2-binding proteins" or "hVR-l-bp", 
"hVR-2-bp", and "rVR-2-bp") and are involved in hVR-1, hVR-2, and rVR-2 activity. 
Such hVR-1 , hVR-2, and rVR-2-binding proteins are also likely to be involved in the 
propagation of signals by the hVR-1, hVR-2, and rVR-2 proteins or hVR-1, hVR-2, and 
rVR-2 targets as, for example, downstream elements of an hVR-1, hVR-2, and rVR-2- 
20 mediated signaling pathway, e.g., a pain signaling pathway. Alternatively, such hVR-1, 
hVR-2, and rVR-2-binding proteins are likely to be hVR-1, hVR-2, and rVR-2 
inhibitors. 

The two-hybrid system is based on the modular nature of most transcription 
factors, which consist of separable DNA-binding and activation domains. Briefly, the 
25 assay utilizes two different DNA constructs. In one construct, the gene that codes for an 
hVR-1 , hVR-2, and rVR-2 protein is fused to a gene encoding the DNA binding domain 
of a known transcription factor {e.g., GAL-4). In the other construct, a DNA sequence, 
from a library of DNA sequences, that encodes an unidentified protein ("prey" or 
"sample") is fused to a gene that codes for the activation domain of the known 
30 transcription factor. If the "bait" and the "prey" proteins are able to interact, in vivo, 
forming an hVR-1, hVR-2, and rVR-2-dependent complex, the DNA-binding and 
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activation domains of the transcription factor are brought into close proximity. This 
proximity allows transcription of a reporter gene {e.g. , LacZ) which is operably linked to 
a transcriptional regulatory site responsive to the transcription factor. Expression of the 
reporter gene can be detected and cell colonies containing the functional transcription 
5 factor can be isolated and used to obtain the cloned gene which encodes the protein 
which interacts with the hVR-1, hVR-2, and rVR-2 protein. 

This invention further pertains to novel agents identified by the above-described 
screening assays. Accordingly, it is within the scope of this invention to further use an 
agent identified as described herein in an appropriate animal model. For example, an 
10 agent identified as described herein {e.g., an hVR-1, hVR-2, and rVR-2 modulating 

agent, an antisense hVR-1, hVR-2, and rVR-2 nucleic acid molecule, an hVR-1, hVR-2, 
and rVR-2-specific antibody, or an hVR-1, hVR-2, and rVR-2-binding partner) can be 
used in an animal model to determine the efficacy, toxicity, or side effects of treatment 
with such an agent. Alternatively, an agent identified as described herein can be used in 
1 5 an animal model to determine the mechanism of action of such an agent. Furthermore, 
this invention pertains to uses of novel agents identified by the above-described 
screening assays for treatments as described herein. 

B. Detection Assays 

20 Portions or fragments of the cDN A sequences identified herein (and the 

corresponding complete gene sequences) can be used in numerous ways as 
polynucleotide reagents. For example, these sequences can be used to: (i) map their 
respective genes on a chromosome; and, thus, locate gene regions associated with 
genetic disease; (ii) identify an individual from a minute biological sample (tissue 

25 typing); and (iii) aid in forensic identification of a biological sample. These applications 
are described in the subsections below. 

1 . Chromosome Mapping 

Once the sequence (or a portion of the sequence) of a gene has been isolated, this 
30 sequence can be used to map the location of the gene on a chromosome. This process is 
called chromosome mapping. Accordingly, portions or fragments of the hVR-1 , hVR-2, 
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and rVR-2 nucleotide sequences, described herein, can be used to map the location of 
the hVR-1 , hVR-2, and rVR-2 genes on a chromosome. The mapping of the hVR-1, 
hVR-2, and rVR-2 sequences to chromosomes is an important first step in correlating 
these sequences with genes associated with disease. 
5 Briefly, hVR-1, hVR-2, and rVR-2 genes can be mapped to chromosomes by 

preparing PGR primers (preferably 15-25 bp in length) from the hVR-1, hVR-2, and 
rVR-2 nucleotide sequences. Computer analysis of the hVR-1, hVR-2, and rVR-2 
sequences can be used to predict primers that do not span more than one exon in the 
genomic DNA, thus complicating the amplification process. These primers can then be 

10 used for PCR screening of somatic cell hybrids containing individual human 

chromosomes. Only those hybrids containing the human gene corresponding to the 
hVR-L hVR-2, and rVR-2 sequences will yield an amplified fragment. 

Somatic cell hybrids are prepared by fusing somatic cells from different 
mammals {e.g., human and mouse cells). As hybrids of human and mouse cells grow 

15 and divide, they gradually lose human chromosomes in random order, but retain the 

mouse chromosomes. By using media in which mouse cells cannot grow, because they 
lack a particular enzyme, but human cells can, the one human chromosome that contains 
the gene encoding the needed enzyme, will be retained. By using various media, panels 
of hybrid cell lines can be established. Each cell line in a panel contains either a single 

20 human chromosome or a small number of human chromosomes, and a full set of mouse 
chromosomes, allowing easy mapping of individual genes to specific human 
chromosomes. (D f Eustachio P. et al (1983) Science 220:919-924). Somatic cell 
hybrids containing only fragments of human chromosomes can also be produced by 
using human chromosomes with translocations and deletions. 

25 PCR mapping of somatic cell hybrids is a rapid procedure for assigning a 

particular sequence to a particular chromosome. Three or more sequences can be 
assigned per day using a single thermal cycler. Using the hVR-1, hVR-2, and rVR-2 
nucleotide sequences to design oligonucleotide primers, sublocalization can be achieved 
with panels of fragments from specific chromosomes. Other mapping strategies which 

30 can similarly be used to map an hVR-1, hVR-2, and rVR-2 sequence to its chromosome 
include in situ hybridization (described in Fan, Y. et al. (1990) Proc. NatL Acad. Sci. 
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USA 87 r :6223-27), pre-screening with labeled flow-sorted chromosomes, and pre- 
selection by hybridization to chromosome specific cDNA libraries. 

Fluorescence in situ hybridization (FISH) of a DNA sequence to a metaphase 
chromosomal spread can further be used to provide a precise chromosomal location in 
5 one step. Chromosome spreads can be made using cells whose division has been 

blocked in metaphase by a chemical such as colcemid that disrupts the mitotic spindle. 
The chromosomes can be treated briefly with trypsin, and then stained with Giemsa. A 
pattern of light and dark bands develops on each chromosome, so that the chromosomes 
can be identified individually. The FISH technique can be used with a DNA sequence 
1 0 as short as 500 or 600 bases. However, clones larger than 1 ,000 bases have a higher 
likelihood of binding to a unique chromosomal location with sufficient signal intensity 
for simple detection. Preferably 1 ,000 bases, and more preferably 2,000 bases will 
suffice to get good results at a reasonable amount of time. For a review of this 
technique, see Verma et al. , Human Chromosomes: A Manual of Basic Techniques 
1 5 (Pergamon Press, New York 1 988). 

Reagents for chromosome mapping can be used individually to mark a single 
chromosome or a single site on that chromosome, or panels of reagents can be used for 
marking multiple sites and/or multiple chromosomes. Reagents corresponding to 
noncoding regions of the genes actually are for mapping purposes. Coding sequences 
20 are more likely to be conserved within gene families, thus increasing the chance of cross 
hybridizations during chromosomal mapping. 

Once a sequence has been mapped to a precise chromosomal location, the 
physical position of the sequence on the chromosome can be correlated with genetic map 
data. (Such data are found, for example, in V. McKusick, Mendelian Inheritance in 
25 Man, available on-line through Johns Hopkins University Welch Medical Library). The 
relationship between a gene and a disease, mapped to the same chromosomal region, can 
then be identified through linkage analysis (co-inheritance of physically adjacent genes), 
described in, for example, Egeland, J. et al. (1987) Nature, 325:783-787. 

Moreover, differences in the DNA sequences between individuals affected and 
30 unaffected with a disease associated with the hVR-1, hVR-2, and rVR-2 gene, can be 
determined. If a mutation is observed in some or all of the affected individuals but not 
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in any unaffected individuals, then the mutation is likely to be the causative agent of the 
particular disease. Comparison of affected and unaffected individuals generally involves 
first looking for structural alterations in the chromosomes, such as deletions or 
translocations that are visible from chromosome spreads or detectable using PCR based 
5 on that DN A sequence. Ultimately, complete sequencing of genes from several 

individuals can be performed to confirm the presence of a mutation and to distinguish 
mutations from polymorphisms. 

2. Tissue Typing 

1 0 The h VR- 1 , h VR-2, and rVR-2 sequences of the present invention can also be 

used to identify individuals from minute biological samples. The United States military, 
for example, is considering the use of restriction fragment length polymorphism (RFLP) 
for identification of its personnel. In this technique, an individual's genomic DNA is 
digested with one or more restriction enzymes, and probed on a Southern blot to yield 
1 5 unique bands for identification. This method does not suffer from the current limitations 
of "Dog Tags" which can be lost, switched, or stolen, making positive identification 
difficult. The sequences of the present invention are useful as additional DNA markers 
for RFLP (described in U.S. Patent 5,272,057). 

Furthermore, the sequences of the present invention can be used to provide an 
20 alternative technique which determines the actual base-by-base DNA sequence of 
selected portions of an individual's genome. Thus, the hVR-1, hVR-2, and rVR-2 
nucleotide sequences described herein can be used to prepare two PCR primers from the 
5' and 3' ends of the sequences. These primers can then be used to amplify an 
individual's DNA and subsequently sequence it. 
25 Panels of corresponding DNA sequences from individuals, prepared in this 

manner, can provide unique individual identifications, as each individual will have a 
unique set of such DNA sequences due to allelic differences. The sequences of the 
present invention can be used to obtain such identification sequences from individuals 
and from tissue. The hVR-1 , hVR-2, and rVR-2 nucleotide sequences of the invention 
30 uniquely represent portions of the human genome. Allelic variation occurs to some 

degree in the coding regions of these sequences, and to a greater degree in the noncoding 
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regions. It is estimated that allelic variation between individual humans occurs with a 
frequency of about once per each 500 bases. Each of the sequences described herein 
can, to some degree, be used as a standard against which DNA from an individual can be 
compared for identification purposes. Because greater numbers of polymorphisms occur 

5 in the noncoding regions, fewer sequences are necessary to differentiate individuals. 

If a panel of reagents from hVR-1, hVR-2, and rVR-2 nucleotide sequences 
described herein is used to generate a unique identification database for an individual, 
those same reagents can later be used to identify tissue from that individual. Using the 
unique identification database, positive identification of the individual, living or dead, 

10 can be made from extremely small tissue samples. 

3. Use of Partial hVR-1, hVR-2, and rVR-2 Sequences in Forensic Biology 
DNA-based identification techniques can also be used in forensic biology. 
Forensic biology is a scientific field employing genetic typing of biological evidence 
1 5 found at a crime scene as a means for positively identifying, for example, a perpetrator 
of a crime. To make such an identification, PCR technology can be used to amplify 
DNA sequences taken from very small biological samples such as tissues, e.g., hair or 
skin, or body fluids, e.g., blood, saliva, or semen found at a crime scene. The amplified 
sequence can then be compared to a standard, thereby allowing identification of the 
20 origin of the biological sample. 

The sequences of the present invention can be used to provide polynucleotide 
reagents, e.g. , PCR primers, targeted to specific loci in the human genome, which can 
enhance the reliability of DNA-based forensic identifications by, for example, providing 
another "identification marker" (i.e. another DNA sequence that is unique to a particular 
25 individual). As mentioned above, actual base sequence information can be used for 
identification as an accurate alternative to patterns formed by restriction enzyme 
generated fragments. Examples of polynucleotide reagents include the hVR-1, hVR-2, 
and rVR-2 nucleotide sequences or portions thereof, e.g., fragments derived from SEQ 
ID NO:l, 3, 4, 6, 7, 9, 10, or 1 1 having a length of at least 20 bases, preferably at least 
30 30 bases. 
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The hVR-1, hVR-2, and rVR-2 nucleotide sequences described herein can further 
be used to provide polynucleotide reagents, e.g., labeled or labelable probes which can 
be used in, for example, an in situ hybridization technique, to identify a specific tissue, 
e.g., brain tissue. This can be very useful in cases where a forensic pathologist is 
5 presented with a tissue of unknown origin. Panels of such hVR-1 , hVR-2, and rVR-2 
probes can be used to identify tissue by species and/or by organ type. 

In a similar fashion, these reagents, e.g., hVR-1, hVR-2, and rVR-2 primers or 
probes can be used to screen tissue culture for contamination (i.e. screen for the presence 
of a mixture of different types of cells in a culture). 

10 

C. Predictive Medicine : 

The present invention also pertains to the field of predictive medicine in which 
diagnostic assays, prognostic assays, and monitoring clinical trials are used for 
prognostic (predictive) purposes to thereby treat an individual prophylactically. 

15 Accordingly, one aspect of the present invention relates to diagnostic assays for 

determining hVR-1, hVR-2, and rVR-2 protein and/or nucleic acid expression as well as 
hVR-1, hVR-2, and rVR-2 activity, in the context of a biological sample (e.g., blood, 
serum, cells, tissue) to thereby determine whether an individual is afflicted with a 
disease or disorder, or is at risk of developing a disorder, associated with aberrant hVR- 

20 1 , hVR-2, and rVR-2 expression or activity. The invention also provides for prognostic 
(or predictive) assays for determining whether an individual is at risk of developing a 
disorder associated with hVR-1, hVR-2, and rVR-2 protein, nucleic acid expression or 
activity. For example, mutations in an hVR-1, hVR-2, and rVR-2 gene can be assayed 
in a biological sample. Such assays can be used for prognostic or predictive purpose to 

25 thereby phophylactically treat an individual prior to the onset of a disorder characterized 
by or associated with hVR-1, hVR-2, and rVR-2 protein, nucleic acid expression or 
activity. 

Another aspect of the invention pertains to monitoring the influence of agents 
(e.g., drugs, compounds) on the expression or activity of hVR-1, hVR-2, and rVR-2 in 
30 clinical trials. 

These and other agents are described in further detail in the following sections. 
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1. Diagnostic Assays 

An exemplary method for detecting the presence or absence of hVR-1, hVR-2, 
and rVR-2 protein or nucleic acid in a biological sample involves obtaining a biological 
sample from a test subject and contacting the biological sample with a compound or an 
5 agent capable of detecting hVR- 1 , hVR-2, and rVR-2 protein or nucleic acid (e.g. , 
mRNA, genomic DNA) that encodes hVR-1, hVR-2, and rVR-2 protein such that the 
presence of hVR-1, hVR-2, and rVR-2 protein or nucleic acid is detected in the 
biological sample. A agent for detecting hVR-1, hVR-2, and rVR-2 mRNA or genomic 
DNA is a labeled nucleic acid probe capable of hybridizing to hVR-K hVR-2, and rVR- 

10 2 mRNA or genomic DNA. The nucleic acid probe can be, for example, a full-length 
hVR-1, hVR-2, and rVR-2 nucleic acid, such as the nucleic acid of SEQ ID NO:l, 3, 4, 
6, 7, 9, 10, or 12, or a portion thereof, such as an oligonucleotide of at least 15, 30, 50, 
100, 250 or 500 nucleotides in length and sufficient to specifically hybridize under 
stringent conditions to hVR-1, hVR-2, and rVR-2 mRNA or genomic DNA. Other 

1 5 suitable probes for use in the diagnostic assays of the invention are described herein. 

An agent for detecting hVR-1, hVR-2, and rVR-2 protein is an antibody capable 
of binding to hVR-1, hVR-2, and rVR-2 protein, preferably an antibody with a 
detectable label. Antibodies can be polyclonal, or more preferably, monoclonal. An 
intact antibody, or a fragment thereof (e.g., Fab or F(ab'>2) can be used. The term 

20 "labeled", with regard to the probe or antibody, is intended to encompass direct labeling 
of the probe or antibody by coupling (i.e., physically linking) a detectable substance to 
the probe or antibody, as well as indirect labeling of the probe or antibody by reactivity 
with another reagent that is directly labeled. Examples of indirect labeling include 
detection of a primary antibody using a fluorescently labeled secondary antibody and 

25 end-labeling of a DNA probe with biotin such that it can be detected with fluorescently 
labeled streptavidin. The term "biological sample" is intended to include tissues, cells 
and biological fluids isolated from a subject, as well as tissues, cells and fluids present 
within a subject. That is, the detection method of the invention can be used to detect 
hVR-1, hVR-2, and rVR-2 mRNA, protein, or genomic DNA in a biological sample in 

30 vitro as well as in vivo. For example, in vitro techniques for detection of hVR-1, hVR-2, 
and rVR-2 mRNA include Northern hybridizations and in situ hybridizations. In vitro 
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techniques for detection of hVR-1, hVR-2, and rVR-2 protein include enzyme linked 
immunosorbent assays (ELISAs), Western blots, immunoprecipitations and 
immunofluorescence. In vitro techniques for detection of hVR- 1 , hVR-2, and rVR-2 
genomic DNA include Southern hybridizations. Furthermore, in vivo techniques for 
5 detection of hVR-1, hVR-2, and rVR-2 protein include introducing into a subject a 
labeled anti-hVR-1, hVR-2, and rVR-2 antibody. For example, the antibody can be 
labeled with a radioactive marker whose presence and location in a subject can be 
detected by standard imaging techniques. 

In one embodiment, the biological sample contains protein molecules from the 
10 test subject. Alternatively, the biological sample can contain mRNA molecules from the 
test subject or genomic DNA molecules from the test subject. A biological sample is a 
serum sample isolated by conventional means from a subject. 

In another embodiment, the methods further involve obtaining a control 
biological sample from a control subject, contacting the control sample with a 
1 5 compound or agent capable of detecting hVR- 1 , hVR-2, and rVR-2 protein, mRNA, or 
genomic DNA, such that the presence of hVR-1, hVR-2, and rVR-2 protein, mRNA or 
genomic DNA is detected in the biological sample, and comparing the presence of hVR- 

1 , hVR-2, and rVR-2 protein, mRNA or genomic DNA in the control sample with the 
presence of hVR-1, hVR-2, and rVR-2 protein, mRNA or genomic DNA in the test 

20 sample. 

The invention also encompasses kits for detecting the presence of hVR-1, hVR- 

2, and rVR-2 in a biological sample. For example, the kit can comprise a labeled 
compound or agent capable of detecting hVR-1, hVR-2, and rVR-2 protein or mRNA in 
a biological sample; means for determining the amount of hVR-1, hVR-2, and rVR-2 in 

25 the sample; and means for comparing the amount of hVR-1, hVR-2, and rVR-2 in the 
sample with a standard. The compound or agent can be packaged in a suitable container. 
The kit can further comprise instructions for using the kit to detect hVR-1, hVR-2, and 
rVR-2 protein or nucleic acid. 
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2. Prognostic Assays 

The diagnostic methods described herein can furthermore be utilized to identify 
subjects having or at risk of developing a disease or disorder associated with aberrant 
hVR-1, hVR-2, and rVR-2 expression or activity. As used herein, the term "aberrant" 
5 includes an hVR-1, hVR-2, and rVR-2 expression or activity which deviates from the 
wild type hVR-1, hVR-2, and rVR-2 expression or activity. Aberrant expression or 
activity includes increased or decreased expression or activity, as well as expression or 
activity which does not follow the wild type developmental pattern of expression or the 
subcellular pattern of expression. For example, aberrant hVR-1 , hVR-2, and rVR-2 

10 expression or activity is intended to include the cases in which a mutation in the hVR-1, 
hVR-2, and rVR-2 gene causes the hVR-1, hVR-2, and rVR-2 gene to be under- 
expressed or over-expressed and situations in which such mutations result in a non- 
functional hVR-1, hVR-2, and rVR-2 protein or a protein which does not function in a 
wild-type fashion, e.g., a protein which does not interact with an hVR-1, hVR-2, and 

15 rVR-2 ligand or one which interacts with a non-hVR-1 , non-hVR-2, and non-rVR-2 
ligand. 

The assays described herein, such as the preceding diagnostic assays or the 
following assays, can be utilized to identify a subject having or at risk of developing a 
disorder associated with a misregulation in hVR-1, hVR-2, and rVR-2 protein activity or 

20 nucleic acid expression, such as a pain disorder. Alternatively, the prognostic assays can 
be utilized to identify a subject having or at risk for developing a disorder associated 
with a misregulation in hVR-1, hVR-2, and rVR-2 protein activity or nucleic acid 
expreission, such as a pain disorder. Thus, the present invention provides a method for 
identifying a disease or disorder associated with aberrant hVR-1, hVR-2, and rVR-2 

25 expression or activity in which a test sample is obtained from a subject and hVR-1, 
hVR-2, and rVR-2 protein or nucleic acid (e.g., mRNA or genomic DNA) is detected, 
wherein the presence of hVR-1, hVR-2, and rVR-2 protein or nucleic acid is diagnostic 
for a subject having or at risk of developing a disease or disorder associated with 
aberrant hVR-1 , hVR-2, and rVR-2 expression or activity. As used herein, a "test 

30 sample" refers to a biological sample obtained from a subject of interest. For example, a 
test sample can be a biological fluid (e.g., serum), cell sample, or tissue. 
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Furthermore, the prognostic assays described herein can be used to determine 
whether a subject can be administered an agent (e.g., an agonist, antagonist, 
peptidomimetic, protein, peptide, nucleic acid, small molecule, or other drug candidate) 
to treat a disease or disorder associated with aberrant hVR-1, hVR-2, and rVR-2 
5 expression or activity. For example, such methods can be used to determine whether a 
subject can be effectively treated with an agent for a pain disorder. Thus, the present 
invention provides methods for determining whether a subject can be effectively treated 
with an agent for a disorder associated with aberrant hVR-1, hVR-2, and rVR-2 
expression or activity in which a test sample is obtained and hVR-1, hVR-2, and rVR-2 
1 0 protein or nucleic acid expression or activity is detected (e.g., wherein the abundance of 
hVR-L hVR-2, and rVR-2 protein or nucleic acid expression or activity is diagnostic for 
a subject that can be administered the agent to treat a disorder associated with aberrant 
hVR-L hVR-2, and rVR-2 expression or activity). 

The methods of the invention can also be used to detect genetic alterations in an 
15 hVR-1, hVR-2, and rVR-2 gene, thereby determining if a subject with the altered gene is 
at risk for a disorder characterized by misregulation in hVR-1, hVR-2, and rVR-2 
protein activity or nucleic acid expression, such as a neurodegenerative disorder. In 
embodiments, the methods include detecting, in a sample of cells from the subject, the 
presence or absence of a genetic alteration characterized by at least one of an alteration 
20 affecting the integrity of a gene encoding an hVR-1 , hVR-2, and rVR-2 -protein, or the 
mis-expression of the hVR-1 , hVR-2, and rVR-2 gene. For example, such genetic 
alterations can be detected by ascertaining the existence of at least one of 1) a deletion of 
one or more nucleotides from an hVR-1, hVR-2, and rVR-2 gene; 2) an addition of one 
or more nucleotides to an hVR-1, hVR-2, and rVR-2 gene; 3) a substitution of one or 
25 more nucleotides of an hVR- 1 , hVR-2, and rVR-2 gene, 4) a chromosomal 

rearrangement of an hVR-1, hVR-2, and rVR-2 gene; 5) an alteration in the level of a 
messenger RNA transcript of an hVR-1, hVR-2, and rVR-2 gene, 6) aberrant 
modification of an hVR-1, hVR-2, and rVR-2 gene, such as of the methylation pattern of 
the genomic DNA, 7) the presence of a non-wild type splicing pattern of a messenger 
30 RNA transcript of an hVR-1, hVR-2, and rVR-2 gene, 8) a non-wild type level of an 

hVR-1, hVR-2, and rVR-2-protein, 9) allelic loss of an hVR-1, hVR-2, and rVR-2 gene, 
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and 10) inappropriate post-translational modification of an hVR-1, hVR-2, and rVR-2- 
protein. As described herein, there are a large number of assays known in the art which 
can be used for detecting alterations in an hVR-1, hVR-2, and rVR-2 gene. A biological 
sample is a tissue or serum sample isolated by conventional means from a subject. 
5 In certain embodiments, detection of the alteration involves the use of a 

probe/primer in a polymerase chain reaction (PCR) (see, e.g., U.S. Patent Nos. 
4,683,195 and 4,683,202), such as anchor PCR or RACE PCR, or, alternatively, in a 
ligation chain reaction (LCR) (see, e.g., Landegran et al (1988) Science 241 : 1077-1 080; 
and Nakazawa et al (1994) Proc. Natl. Acad. Sci. USA 91:360-364), the latter of which 

10 can be particularly useful for detecting point mutations in the hVR-1, hVR-2, and rVR- 
2-gene (see Abravaya et al (1995) Nucleic Acids Res .23:675-682). This method can 
include the steps of collecting a sample of cells from a subject, isolating nucleic acid 
{e.g., genomic, mRNA or both) from the cells of the sample, contacting the nucleic acid 
sample with one or more primers which specifically hybridize to an hVR-1, hVR-2, and 

1 5 rVR-2 gene under conditions such that hybridization and amplification of the hVR-I , 
hVR-2, and rVR-2-gene (if present) occurs, and detecting the presence or absence of an 
amplification product, or detecting the size of the amplification product and comparing 
the length to a control sample. It is anticipated that PCR and/or LCR may be desirable 
to use as a preliminary amplification step in conjunction with any of the techniques used 

20 for detecting mutations described herein. 

Alternative amplification methods include: self sustained sequence replication 
(Guatelli, J.C. et al, (1990) Proc. Natl Acad. Sci. USA 87:1874-1878), transcriptional 
amplification system (Kwoh, D.Y. et al, (1989) Proc. Natl Acad. Set USA 86:1 173- 
1 177), Q-Beta Replicase (Lizardi, P.M. et al (1988) Bio-Technology 6:1 197), or any 

25 other nucleic acid amplification method, followed by the detection of the amplified 
molecules using techniques well known to those of skill in the art. These detection 
schemes are especially useful for the detection of nucleic acid molecules if such 
molecules are present in very low numbers. 

In an alternative embodiment, mutations in an hVR-1, hVR-2, and rVR-2 gene 

30 from a sample cell can be identified by alterations in restriction enzyme cleavage 
patterns. For example, sample and control DNA is isolated, amplified (optionally), 
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digested with one or more restriction endonucleases, and fragment length sizes are 
determined by gel electrophoresis and compared. Differences in fragment length sizes 
between sample and control DNA indicates mutations in the sample DNA. Moreover, 
the use of sequence specific ribozymes (see, for example, U.S. Patent No. 5,498,531) 
5 can be used to score for the presence of specific mutations by development or loss of a 
ribozyme cleavage site. 

In other embodiments, genetic mutations in hVR-1, hVR-2, and rVR-2 can be 
identified by hybridizing a sample and control nucleic acids, e.g., DNA or RNA, to high 
density arrays containing hundreds or thousands of oligonucleotides probes (Cronin, 
10 M.T. et al. (1996) Human Mutation 7: 244-255; Kozal, M.J. et al (1 996) Nature 

Medicine 2: 753-759). For example, genetic mutations in hVR-1, hVR-2, and rVR-2 
can be identified in two dimensional arrays containing light-generated DNA probes as 
described in Cronin, M.T. et al supra. Briefly, a first hybridization array of probes can 
be used to scan through long stretches of DNA in a sample and control to identify base 
1 5 changes between the sequences by making linear arrays of sequential overlapping 

probes. This step allows the identification of point mutations. This step is followed by 
a second hybridization array that allows the characterization of specific mutations by 
using smaller, specialized probe arrays complementary to all variants or mutations 
detected. Each mutation array is composed of parallel probe sets, one complementary to 
20 the wild-type gene and the other complementary to the mutant gene. 

In yet another embodiment, any of a variety of sequencing reactions known in 
the art can be used to directly sequence the hVR-1, hVR-2, and rVR-2 gene and detect 
mutations by comparing the sequence of the sample hVR-1, hVR-2, and rVR-2 with the 
corresponding wild-type (control) sequence. Examples of sequencing reactions include 
25 those based on techniques developed by Maxam and Gilbert ((1977) Proc. Natl Acad. 
Scl USA 74:560) or Sanger ((1977) Proc. Natl Acad. ScL USA 74:5463). It is also 
contemplated that any of a variety of automated sequencing procedures can be utilized 
when performing the diagnostic assays ((1995) Biotechniques 19:448), including 
sequencing by mass spectrometry (see, e.g., PCT International Publication No. WO 
30 94/16101; Cohen et al (1996) Adv. Chromatogr. 36:127-162; and Griffin et al (1993) 
Appl. Biochem. Biotechnol 38:147-159). 
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Other methods for detecting mutations in the hVR-1 , hVR-2, and rVR-2 gene 
include methods in which protection from cleavage agents is used to detect mismatched 
bases in RNA/RNA or RNA/DNA heteroduplexes (Myers et al (1985) Science 
230:1242). In general, the art technique of "mismatch cleavage" starts by providing 
5 heteroduplexes of formed by hybridizing (labeled) RNA or DNA containing the wild- 
type hVR-1, hVR-2, and rVR-2 sequence with potentially mutant RNA or DNA 
obtained from a tissue sample. The double-stranded duplexes are treated with an agent 
which cleaves single-stranded regions of the duplex such as which will exist due to 
basepair mismatches between the control and sample strands. For instance, RNA/DNA 

1 0 duplexes can be treated with RNase and DNA/DNA hybrids treated with S 1 nuclease to 
enzymatically digesting the mismatched regions. In other embodiments, either 
DNA/DNA or RNA/DNA duplexes can be treated with hydroxylamine or osmium 
tetroxide and with piperidine in order to digest mismatched regions. After digestion of 
the mismatched regions, the resulting material is then separated by size on denaturing 

1 5 polyacrylamide gels to determine the site of mutation. See, for example, Cotton et al 
(1988) Proc. Natl Acad Sci USA 85:4397; Saleeba et al (1992) Methods Enzymol 
217:286-295. In a embodiment, the control DNA or RNA can be labeled for detection. 

In still another embodiment, the mismatch cleavage reaction employs one or 
more proteins that recognize mismatched base pairs in double-stranded DNA (so called 

20 "DNA mismatch repair" enzymes) in defined systems for detecting and mapping point 
mutations in hVR-1, hVR-2, and rVR-2 cDNAs obtained from samples of cells. For 
example, the mutY enzyme of E. coli cleaves A at G/A mismatches and the thymidine 
DNA glycosylase from HeLa cells cleaves T at G/T mismatches (Hsu et al (1994) 
Carcinogenesis 15:1657-1662). According to an exemplary embodiment, a probe based 

25 on an hVR- 1 , hVR-2, and rVR-2 sequence, e.g. , a wild-type hVR- 1 , hVR-2, and rVR-2 
sequence, is hybridized to a cDNA or other DNA product from a test cell(s). The duplex 
is treated with a DNA mismatch repair enzyme, and the cleavage products, if any, can be 
detected from electrophoresis protocols or the like. See, for example, U.S. Patent No. 
5,459,039. 
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In other embodiments, alterations in electrophoretic mobility will be used to 
identify mutations in hVR-1, hVR-2, and rVR-2 genes. For example, single strand 
conformation polymorphism (SSCP) may be used to detect differences in electrophoretic 
mobility between mutant and wild type nucleic acids (orita et al (1989) Proc Natl 
5 Acad. Sci USA: 86:2766, see also Cotton (1993) Mutat. Res. 285: 125-144; and Hayashi 
(1992) Genet. Anal Tech. Appl 9:73-79). Single-stranded DNA fragments of sample 
and control hVR-1, hVR-2, and rVR-2 nucleic acids will be denatured and allowed to 
renature. The secondary structure of single-stranded nucleic acids varies according to 
sequence, the resulting alteration in electrophoretic mobility enables the detection of 
10 even a single base change. The DNA fragments may be labeled or detected with labeled 
probes. The sensitivity of the assay may be enhanced by using RNA (rather than DNA), 
in which the secondary structure is more sensitive to a change in sequence. In a 
embodiment, the subject method utilizes heteroduplex analysis to separate double 
stranded heteroduplex molecules on the basis of changes in electrophoretic mobility 
1 5 (Keen et (1991) Trends Genet 7:5). 

In yet another embodiment the movement of mutant or wild-type fragments in 
polyacrylamide gels containing a gradient of denaturant is assayed using denaturing 
gradient gel electrophoresis (DGGE) (Myers et al. (1 985) Nature 3 1 3:495). When 
DGGE is used as the method of analysis, DNA will be modified to insure that it does not 
20 completely denature, for example by adding a GC clamp of approximately 40 bp of 

high-melting GC-rich DNA by PCR. In a further embodiment, a temperature gradient is 
used in place of a denaturing gradient to identify differences in the mobility of control 
and sample DNA (Rosenbaum and Reissner (1987) Biophys Chem 265:12753). 

Examples of other techniques for detecting point mutations include, but are not 
25 limited to, selective oligonucleotide hybridization, selective amplification, or selective 
primer extension. For example, oligonucleotide primers may be prepared in which the 
known mutation is placed centrally and then hybridized to target DNA under conditions 
which permit hybridization only if a perfect match is found (Saiki et al (1986) Nature 
324:163); Saiki et al. (1989) Proc. Natl Acad Sci USA 86:6230). Such allele specific 
30 oligonucleotides are hybridized to PCR amplified target DNA or a number of different 
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mutations when the oligonucleotides are attached to the hybridizing membrane and 
hybridized with labeled target DNA. 

Alternatively, allele specific amplification technology which depends on selective 
PCR amplification may be used in conjunction with the instant invention. 
5 Oligonucleotides used as primers for specific amplification may carry the mutation of 
interest in the center of the molecule (so that amplification depends on differential 
hybridization) (Gibbs et al (1989) Nucleic Acids Res, 17:2437-2448) or at the extreme 3* 
end of one primer where, under appropriate conditions, mismatch can prevent, or reduce 
polymerase extension (Prossner (1993) Tibtech 1 1 :238). In addition it may be desirable 

1 0 to introduce a novel restriction site in the region of the mutation to create cleavage-based 
detection (Gasparini et ai (1992) Mol. Cell Probes 6:1). It is anticipated that in certain 
embodiments amplification may also be performed using Taq ligase for amplification 
(Barany (1991) Proc. Natl. Acad. Set USA 88:189). In such cases, ligation will occur 
only if there is a perfect match at the 3* end of the 5' sequence making it possible to detect 

1 5 the presence of a known mutation at a specific site by looking for the presence or absence 
of amplification. 

The methods described herein may be performed, for example, by utilizing pre- 
packaged diagnostic kits comprising at least one probe nucleic acid or antibody reagent 
described herein, which may be conveniently used, e.g., in clinical settings to diagnose 
20 patients exhibiting symptoms or family history of a disease or illness involving an hVR- 
1, hVR-2, and rVR-2 gene. 

Furthermore, any cell type or tissue in which hVR-1, hVR-2, and rVR-2 is 
expressed may be utilized in the prognostic assays described herein. 

25 3. Monitoring of Effects During Clinical Trials 

Monitoring the influence of agents (e.g., drugs) on the expression or activity of 
an hVR-1, hVR-2, and rVR-2 protein can be applied not only in basic drug screening, 
but also in clinical trials. For example, the effectiveness of an agent determined by a 
screening assay as described herein to increase hVR-1, hVR-2, and rVR-2 gene 

30 expression, protein levels, or upregulate hVR-1, hVR-2, and rVR-2 activity, can be 
monitored in clinical trials of subjects exhibiting decreased hVR-1, hVR-2, and rVR-2 
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gene expression, protein levels, or downregulated hVR-1, hVR-2, and rVR-2 activity. 
Alternatively, the effectiveness of an agent determined by a screening assay to decrease 
hVR-1, hVR-2, and rVR-2 gene expression, protein levels, or downregulate hVR-1, 
hVR-2, and rVR-2 activity, can be monitored in clinical trials of subjects exhibiting 
5 increased hVR-1 , hVR-2, and rVR-2 gene expression, protein levels, or upregulated 
hVR-1, hVR-2, and rVR-2 activity. In such clinical trials, the expression or activity of 
an hVR-1, hVR-2, and rVR-2 gene, and preferably, other genes that have been 
implicated in, for example, an hVR-1, hVR-2, and rVR-2-associated disorder can be 
used as a "read out" or markers of the phenotype of a particular cell. 
10 For example, and not by way of limitation, genes, including hVR-1, hVR-2, and 

rVR-2, that are modulated in cells by treatment with an agent (e.g., compound, drug or 
small molecule) which modulates hVR-1, hVR-2, and rVR-2 activity (e.g., identified in 
a screening assay as described herein) can be identified. Thus, to study the effect of 
agents on hVR-1, hVR-2, and rVR-2-associated disorders (e.g., pain disorders), for 
15 example, in a clinical trial, cells can be isolated and RNA prepared and analyzed for the 
levels of expression of hVR-1, hVR-2, and rVR-2 and other genes implicated in the 
hVR-1, hVR-2, and rVR-2-associated disorder, respectively. The levels of gene 
expression (e.g., a gene expression pattern) can be quantified by northern blot analysis 
or RT-PCR, as described herein, or alternatively by measuring the amount of protein 

20 produced, by one of the methods as described herein, or by measuring the levels of 
activity of hVR-1 , hVR-2, and rVR-2 or other genes. In this way, the gene expression 
pattern can serve as a marker, indicative of the physiological response of the cells to the 
agent. Accordingly, this response state may be determined before, and at various points 
during treatment of the individual with the agent. 

25 In a embodiment, the present invention provides a method for monitoring the 

effectiveness of treatment of a subject with an agent (e.g., an agonist, antagonist, 
peptidomimetic, protein, peptide, nucleic acid, small molecule, or other drug candidate 
identified by the screening assays described herein) including the steps of (i) obtaining a 
pre-administration sample from a subject prior to administration of the agent; (ii) 

30 detecting the level of expression of an hVR-1 , hVR-2, and rVR-2 protein, mRNA, or 
genomic DNA in the preadministration sample; (iii) obtaining one or more post- 
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administration samples from the subject; (iv) detecting the level of expression or activity 
of the hVR-1, hVR-2, and rVR-2 protein, mRNA, or genomic DNA in the post- 
administration samples; (v) comparing the level of expression or activity of the hVR-1, 
hVR-2, and rVR-2 protein, mRNA, or genomic DNA in the pre-administration sample 
5 with the hVR-1, hVR-2, and rVR-2 protein, mRNA, or genomic DNA in the post 

administration sample or samples; and (vi) altering the administration of the agent to the 
subject accordingly. For example, increased administration of the agent may be 
desirable to increase the expression or activity of hVR-1, hVR-2, and rVR-2 to higher 
levels than detected, Le., to increase the effectiveness of the agent. Alternatively, 
1 0 decreased administration of the agent may be desirable to decrease expression or activity 
of hVR-1, hVR-2, and rVR-2 to lower levels than detected, Le. to decrease the 
effectiveness of the agent. According to such an embodiment, hVR-1, hVR-2, and rVR- 
2 expression or activity may be used as an indicator of the effectiveness of an agent, 
even in the absence of an observable phenotypic response. 

15 

D. Methods of Treatment : 

The present invention provides for both prophylactic and therapeutic methods of 
treating a subject at risk of (or susceptible to) a disorder or having a disorder associated 
with aberrant hVR-1, hVR-2, and rVR-2 expression or activity. With regards to both 

20 prophylactic and therapeutic methods of treatment, such treatments may be specifically 
tailored or modified, based on knowledge obtained from the field of pharmacogenomics. 
"Pharmacogenomics", as used herein, refers to the application of genomics technologies 
such as gene sequencing, statistical genetics, and gene expression analysis to drugs in 
clinical development and on the market. More specifically, the term refers the study of 

25 how a patient's genes determine his or her response to a drug (e.g., a patient's "drug 
response phenotype", or "drug response genotype".) Thus, another aspect of the 
invention provides methods for tailoring an individual's prophylactic or therapeutic 
treatment with either the hVR-1, hVR-2, and rVR-2 molecules of the present invention 
or hVR-1, hVR-2, and rVR-2 modulators according to that individual's drug response 

30 genotype. Pharmacogenomics allows a clinician or physician to target prophylactic or 
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therapeutic treatments to patients who will most benefit from the treatment and to avoid 
treatment of patients who will experience toxic drug-related side effects. 

1. Prophylactic Methods 
5 In one aspect, the invention provides a method for preventing in a subject, a 

disease or condition associated with an aberrant hVR-1, hVR-2, and rVR-2 expression 
or activity, by administering to the subject an hVR-1, hVR-2, and rVR-2 or an agent 
which modulates hVR-1, hVR-2, and rVR-2 expression or at least one hVR-1, hVR-2, 
and rVR-2 activity. Subjects at risk for a disease which is caused or contributed to by 

10 aberrant hVR-1, hVR-2, and rVR-2 expression or activity can be identified by, for 
example, any or a combination of diagnostic or prognostic assays as described herein. 
Administration of a prophylactic agent can occur prior to the manifestation of symptoms 
characteristic of the hVR-1, hVR-2, and rVR-2 aberrancy, such that a disease or disorder 
is prevented or, alternatively, delayed in its progression. Depending on the type of hVR- 

15 1 , hVR-2, and rVR-2 aberrancy, for example, an hVR-1 9 hVR-2, and rVR-2, hVR-1 , 
hVR-2, and rVR-2 agonist or hVR-1, hVR-2, and rVR-2 antagonist agent can be used 
for treating the subject. The appropriate agent can be determined based on screening 
assays described herein. 

20 2. Therapeutic Methods 

Another aspect of the invention pertains to methods of modulating hVR-1, hVR- 
2, and rVR-2 expression or activity for therapeutic purposes. Accordingly, in an 
exemplary embodiment, the modulatory method of the invention involves contacting a 
cell with an hVR-1, hVR-2, and rVR-2 or agent that modulates one or more of the 

25 activities of hVR-1 , hVR-2, and rVR-2 protein activity associated with the cell. An 
agent that modulates hVR-1, hVR-2, and rVR-2 protein activity can be an agent as 
described herein, such as a nucleic acid or a protein, a naturally-occurring target 
molecule of an hVR-1, hVR-2, and rVR-2 protein (e.g., an hVR-1, hVR-2, and rVR-2 
substrate), an hVR-1, hVR-2, and rVR~2 antibody, an hVR-1, hVR-2, and rVR-2 agonist 

30 or antagonist, a peptidomimetic of an hVR-1 , hVR-2, and rVR-2 agonist or antagonist, 
or other small molecule. In one embodiment, the agent stimulates one or more hVR-1 , 
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1, hVR-2, and rVR-2 protein and a nucleic acid molecule encoding hVR-1, hVR-2, and 
rVR-2 that has been introduced into the cell. In another embodiment, the agent inhibits 
one or more hVR-1, hVR-2, and rVR-2 activities. Examples of such inhibitory agents 

5 include antisense hVR-1, hVR-2, and rVR-2 nucleic acid molecules, anti-hVR-1, hVR- 

2, and rVR-2 antibodies, and hVR-1 , hVR-2, and rVR-2 inhibitors. These modulatory 
methods can be performed in vitro (e.g., by culturing the cell with the agent) or, 
alternatively, in vivo (e.g., by administering the agent to a subject). As such, the present 
invention provides methods of treating an individual afflicted with a disease or disorder 

10 characterized by aberrant expression or activity of an hVR-1, hVR-2, and rVR-2 protein 
or nucleic acid molecule. In one embodiment, the method involves administering an 
agent (e.g., an agent identified by a screening assay described herein), or combination of 
agents that modulates (e.g., upregulates or downregulates) hVR-1, hVR-2, and rVR-2 
expression or activity. In another embodiment, the method involves administering an 

15 hVR-1, hVR-2, and rVR-2 protein or nucleic acid molecule as therapy to compensate for 
reduced or aberrant hVR-1, hVR-2, and rVR-2 expression or activity. 

Stimulation of hVR-1, hVR-2, and rVR-2 activity is desirable in situations in 
which hVR-1, hVR-2, and rVR-2 is abnormally downregulated and/or in which 
increased hVR-1, hVR-2, and rVR-2 activity is likely to have a beneficial effect. For 

20 example, stimulation of hVR-1, hVR-2, and rVR-2 activity is desirable in situations in 
which an hVR-1, hVR-2, and rVR-2 is downregulated and/or in which increased hVR-1, 
hVR-2, and rVR-2 activity is likely to have a beneficial effect. Likewise, inhibition of 
hVR-1, hVR-2, and rVR-2 activity is desirable in situations in which hVR-1, hVR-2, 
and rVR-2 is abnormally upregulated and/or in which decreased hVR- 1 , hVR-2, and 

25 rVR-2 activity is likely to have a beneficial effect. 



3. Pharmacogenomics 

The hVR-1, hVR-2, and rVR-2 molecules of the present invention, as well as 
agents, or modulators which have a stimulatory or inhibitory effect on hVR-1, hVR-2, 
30 and rVR-2 activity (e.g., hVR-1, hVR-2, and rVR-2 gene expression) as identified by a 
screening assay described herein can be administered to individuals to treat 
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(prophylactically or therapeutically) hVR-1, hVR-2, and rVR-2-associated disorders 
(e.g., pain disorders) associated with aberrant hVR-1, hVR-2, and rVR-2 activity. In 
conjunction with such treatment, pharmacogenomics (i.e., the study of the relationship 
between an individual's genotype and that individual's response to a foreign compound 
5 or drug) may be considered. Differences in metabolism of therapeutics can lead to 
severe toxicity or therapeutic failure by altering the relation between dose and blood 
concentration of the pharmacologically active drug. Thus, a physician or clinician may 
consider applying knowledge obtained in relevant pharmacogenomics studies in 
determining whether to administer an hVR-1, hVR-2, and rVR-2 molecule or hVR-1, 
10 hVR-2, and rVR-2 modulator as well as tailoring the dosage and/or therapeutic regimen 
of treatment with an hVR-1, hVR-2, and rVR-2 molecule or hVR-1, hVR-2, and rVR-2 
modulator. 

Pharmacogenomics deals with clinically significant hereditary variations in the 
response to drugs due to altered drug disposition and abnormal action in affected 

15 persons. See, for example, Eichelbaum, M et al. (1996) Clin. Exp. Pharmacol. Physiol. 
23(10-1 1) :983-985 and Under, M.W. et al (1997) Clin. Chem. 43(2):254-266. In 
general, two types of pharmacogenetic conditions can be differentiated. Genetic 
conditions transmitted as a single factor altering the way drugs act on the body (altered 
drug action) or genetic conditions transmitted as single factors altering the way the body 

20 acts on drugs (altered drug metabolism). These pharmacogenetic conditions can occur 
either as rare genetic defects or as naturally-occurring polymorphisms. For example, 
glucose-6-phosphate dehydrogenase deficiency (G6PD) is a common inherited 
enzymopathy in which the main clinical complication is haemolysis after ingestion of 
oxidant drugs (anti-malarials, sulfonamides, analgesics, nitrofiirans) and consumption of 

25 fava beans. 

One pharmacogenomics approach to identifying genes that predict drug 
response, known as "a genome-wide association", relies primarily on a high-resolution 
map of the human genome consisting of already known gene-related markers (e.g., a "bi- 
allelic" gene marker map which consists of 60,000-100,000 polymorphic or variable 
30 sites on the human genome, each of which has two variants.) Such a high-resolution 
genetic map can be compared to a map of the genome of each of a statistically 
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significant number of patients taking part in a Phase II/III drug trial to identify markers 
associated with a particular observed drug response or side effect. Alternatively, such a 
high resolution map can be generated from a combination of some ten-million known 
single nucleotide polymorphisms (SNPs) in the human genome. As used herein, a 
5 "SNP" is a common alteration that occurs in a single nucleotide base in a stretch of 
DNA. For example, a SNP may occur once per every 1000 bases of DNA. A SNP may 
be involved in a disease process, however, the vast majority may not be disease- 
associated. Given a genetic map based on the occurrence of such SNPs, individuals can 
be grouped into genetic categories depending on a particular pattern of SNPs in their 
10 individual genome. In such a manner, treatment regimens can be tailored to groups of 
genetically similar individuals, taking into account traits that may be common among 
such genetically similar individuals. 

Alternatively, a method termed the "candidate gene approach", can be utilized to 
identify genes that predict drug response. According to this method, if a gene that 

15 encodes a drugs target is known (e.g., an hVR-1, hVR-2, and rVR-2 protein of the 

present invention), all common variants of that gene can be fairly easily identified in the 
population and it can be determined if having one version of the gene versus another is 
associated with a particular drug response. 

As an illustrative embodiment, the activity of drug metabolizing enzymes is a 

20 major determinant of both the intensity and duration of drug action. The discovery of 
genetic polymorphisms of drug metabolizing enzymes (e.g., N-acetyltransferase 2 (NAT 
2) and cytochrome P450 enzymes CYP2D6 and CYP2C19) has provided an explanation 
as to why some patients do not obtain the expected drug effects or show exaggerated 
drug response and serious toxicity after taking the standard and safe dose of a drug. 

25 These polymorphisms are expressed in two phenotypes in the population, the extensive 
metabolizer (EM) and poor metabolizer (PM). The prevalence of PM is different among 
different populations. For example, the gene coding for CYP2D6 is highly polymorphic 
and several mutations have been identified in PM, which all lead to the absence of 
functional CYP2D6. Poor metabolizers of CYP2D6 and CYP2C19 quite frequently 

30 experience exaggerated drug response and side effects when they receive standard doses. 
If a metabolite is the active therapeutic moiety, PM show no therapeutic response, as 
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demonstrated for the analgesic effect of codeine mediated by its C YP2D6-formed 
metabolite morphine. The other extreme are the so called ultra-rapid metabolizers who 
do not respond to standard doses. Recently, the molecular basis of ultra-rapid 
metabolism has been identified to be due to CYP2D6 gene amplification. 
5 Alternatively, a method termed the "gene expression profiling", can be utilized to 

identify genes that predict drug response. For example, the gene expression of an 
animal dosed with a drug (e.g., an hVR-1, hVR-2, and rVR-2 molecule or hVR-1, hVR- 
2, and rVR-2 modulator of the present invention) can give an indication whether gene 
pathways related to toxicity have been turned on. 

1 0 Information generated from more than one of the above pharmacogenomics 

approaches can be used to determine appropriate dosage and treatment regimens for 
prophylactic or therapeutic treatment an individual. This knowledge, when applied to 
dosing or drug selection, can avoid adverse reactions or therapeutic failure and thus 
enhance therapeutic or prophylactic efficiency when treating a subject with an hVR-1, 

15 hVR-2, and rVR-2 molecule or hVR-1, hVR-2, and rVR-2 modulator, such as a 
modulator identified by one of the exemplary screening assays described herein. 

This invention is further illustrated by the following examples which should not 
be construed as limiting. The contents of all references, patents and published patent 
20 applications cited throughout this application, as well as the Figures and the Sequence 
Listing are incorporated herein by reference. 



EXAMPLES 



25 EXAMPLE 1: IDENTIFICATION AND CHARACTERIZATION 

OF hVR-1, hVR-2, and rVR-2 cDNA 

In this example, the identification and characterization of the genes encoding 
hVR-1 (clone Fchrb87a6), hVR-2 (clone flh21el 1), hVR-2 alternate form (clone 
frhobl2c4), and rVR-2 (clone flrxbl47gl 1) are described. 

30 
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Isolation of the hVR-K hVR-2, and the rVR-2 cDNA 

The invention is based, at least in part, on the discovery of two human genes and 
one rat gene encoding novel members of the Capsaicin/Vanilloid receptor family, 
referred to herein as hVR-1 , hVR-2, and rVR-2, respectively. These clones were 
5 identified from a human heart library and a rat dorsal root ganglion (DRG) library, based 
on sequence homology to the known rat VR-1 (Accession Number AF029310). The 
sequence of the two human clones and the rat clone was determined and found to 
contain open reading frames. 

The nucleotide sequence of the full length hVR-1 cDNA and the predicted amino 
10 acid sequence of the hVR-1 polypeptide are shown in Figure 1 and in SEQ ID NOs: 1 
and 2, respectively. 

The nucleotide sequence of the full length hVR-2 cDNA and the predicted amino 
acid sequence of the hVR-2 polypeptide are shown in Figure 2 and in SEQ ID NOs:4 
and 5, respectively. 

15 The nucleotide sequence of the partial hVR-2 (alternate form) cDNA and the 

predicted amino acid sequence of the hVR-2 (alternate form) polypeptide are shown in 

Figure 3 and in SEQ ID NOs:7 and 8, respectively. 

The amino acid sequence of the predicted full length human VR-2 protein 

(alternate form) is shown in Figure 16 and in SEQ ID NO:20. 
20 The nucleotide sequence of the partial rVR-2 cDNA and the predicted amino 

acid sequence of the rVR-2 polypeptide are shown in Figure 4 and in SEQ ID NOs: 10 

and 11, respectively. 

Analysis of the hVR-1, hVR-2, and rVR-2 Molecules 

25 The hVR-1 protein (SEQ ID NO:2) was aligned with the human VR-2 protein 

(SEQ ID NO:5) using the GAP program in the GCG software package (Blosum 62 
matrix) and a gap weight of 12 and a length weight of 4. The results showed a 46.348% 
identity and 55.378% similarity between the two sequences (see Figure 5). 

The hVR-1 nucleotide sequence (SEQ ID NO:l) was aligned with the human 

30 VR-2 nucleotide sequence (SEQ ID NO:4) using the GAP program in the GCG software 
package (nwsgapdna matrix) and a gap weight of 50 and a length weight of 3. The 
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results showed a 55.316% identity and 55.316% similarity between the two sequences 
(see Figure 6). 

The hVR-2 protein (SEQ ID NO:5) was aligned with the rat VR-2 protein (SEQ 
ID NO:l 1) using the CLUSTAL W (1.74) multiple sequence alignment program (Figure 
5 7), as well as using the GAP program in the GCG software package (Blosum 62 matrix) 
and a gap weight of 12 and a length weight of 4. The results showed a 79.167% identity 
and 81.703% similarity between the two sequences (see Figure 8). 

The hVR-1 nucleotide sequence (SEQ ID NO: 1) was aligned with the rat VR-1 
nucleotide sequence (Accession Number: AF0293 1 0) using the GAP program in the 
10 GCG software package (nwsgapdna matrix) and a gap weight of 50 and a length weight 
of 3. The results showed a 82.125% identity and 82.125% similarity between the two 
sequences (see Figure 9). 

The hVR-1 protein (SEQ ID NO:2) was aligned with the rat VR-1 protein 
(Accession Number: AF0293 10) using the GAP program in the GCG software package 
15 (Blosum 62 matrix) and a gap weight of 12 and a length weight of 4. The results showed 
a 86.022% identity and 89.247% similarity between the two sequences (see Figure 10). 

The hVR-2 protein (SEQ ID NO:5) was aligned with the human VR-2 protein 
(alternate form) (SEQ ID NO:8) using the CLUSTAL W (1.74) multiple sequence 
alignment program (Figure 11). 
20 Finally, the hVR-2 protein (SEQ ID NO:5) was aligned with the predicted full 

length human VR-2 protein (alternate form) (SEQ ID NO:20) using the CLUSTAL W 
(1.74) multiple sequence alignment program (Figure 17). 

A search was performed against the HMM database resulting in the identification 
of three ankyrin repeat domains in the amino acid sequence of hVR-1 (SEQ ID NO:2) at 
25 about residues 201-233, 248-283, and 333-361, and in the amino acid sequence of hVR- 
2 (SEQ ID NO:5) at about residues 162-194, 208-243, and 293-328. The results of the 
searches are set forth in Figures 13 and 15, respectively. 

Hydropathy plots have identified 6 transmembrane domains in the hVR-1 and 
the hVR-2 proteins (see Figures 12 and 14, respectively). 
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A series of searches have revealed that the hVR-1 protein matches the ProDom 
entry 141801 for the vanilloid receptor subtype and the ProDom entry 145518 for the 
vanilloid receptor subtype. 

Moreover, a search was performed against the Prosite database resulting in the 
5 identification of four N-glycosylation sites in the amino acid sequence of SEQ ID NO:5 
(at about residues 171-174, 192-195, 604-607, and 749-752), three cGMP-dependent 
protein kinase phosphorylation sites in the amino acid sequence of SEQ ID NO:5 (at 
about residues 2-5, 368-371, and 499-502), a series of protein kinase C and Casein 
kinase II phosphorylation sites in the amino acid sequence of SEQ ID NO:5, two 
10 tyrosine kinase phosphorylation sites in the amino acid sequence of SEQ ID NO:5 (at 
about residues 368-375 and 622-628), and two myristoylation sites in the amino acid 
sequence of SEQ ID NO:5 (at about residues 169-174 and 765-770). 

Tissue Distribution of hVR-1 and hVR-2 mRNA 

15 This Example describes the tissue distribution of hVR-1 and hVR-2 mRNA as 

determined by in situ hybridization. 

For in situ analysis, tissues, such as brain regions and whole brain, obtained from 
human and monkey were first frozen on dry ice. Ten-micrometer-thick coronal sections 
of the tissues were postfixed with 4% formaldehyde in DEPC treated IX phosphate- 

20 buffered saline at room temperature for 1 0 minutes before being rinsed twice in DEPC 
IX phosphate-buffered saline and once in 0.1 M triethanolamine-HCl (pH 8.0). 
Following incubation in 0.25% acetic anhydride-0.1 M triethanolamine-HCl for 10 
minutes, sections were rinsed in DEPC 2X SSC (IX SSC is 0.1 5M NaCI plus 0.0 15M 
sodium citrate). Tissue was then dehydrated through a series of ethanol washes, 

25 incubated in 100% chloroform for 5 minutes, and then rinsed in 100% ethanol for 1 
minute and 95% ethanol for 1 minute and allowed to air dry. 

Hybridizations were performed with 35 S-radiolabeled (5 X 10 7 cpm/ml) cRNA 
probes. Probes were incubated in the presence of a solution containing 600 mM NaCI, 
10 mM Tris (pH 7.5), 1 mM EDTA, 0.01% sheared salmon sperm DNA, 0.01% yeast 

30 tRNA, 0.05% yeast total RNA type XI, 1 X Denhardt's solution, 50% formamide, 10% 
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dextran sulfate, 100 mM dithiothreitol, 0.1% sodium dodecyl sulfate (SDS), and 0.1% 
sodium thiosulfate for 18 hours at 55°C. 

After hybridization, slides were washed with 2 X SSC. Sections were then 
sequentially incubated at 37°C in TNE (a solution containing 10 mM Tris-HCI (pH 7.6), 
5 500 mM NaCI, and 1 mM EDTA), for 10 minutes, in TNE with 10|ag of RNase A per ml 
for 30 minutes, and finally in TNE for 10 minutes. Slides were then rinsed with 2 X 
SSC at room temperature, washed with 2 X SSC at 50°C for 1 hour, washed with 0.2 X 
SSC at 55°C for 1 hour, and 0.2 X SSC at 60°C for 1 hour. Sections were then 
dehydrated rapidly through serial ethanol-0.3 M sodium acetate concentrations before 
10 being air dried and exposed to Kodak Biomax MR scientific imaging film for 24 hours 
and subsequently dipped in NB-2 photoemulsion and exposed at 4°C for 7 days before 
being developed and counter stained. 

The data indicate that the hVR-1 molecule is not expressed in human nor 
monkey brain. The hVR-1 molecule is expressed in nodose, trigeminal sensory neurons, 
1 5 but is not expressed in sympathetic neurons. Within the nodose sensory neurons and 
trigeminal sensory neurons, expression was seen in distinct sub-populations. Moreover, 
hVRl is expressed in some, but not all, small dorsal root ganglion (DRG) neurons and in 
a few medium sized DRG neurons. The hVR-1 molecule is partially co-expressed with 
the neuropeptide CGRP and with substance P which are present in nociceptive neurons. 
20 The data further indicate that the VR-2 molecule is expressed in both human and 

monkey brain, primarily in cortical neurons. The VR2 molecule is also expressed in 
other brain regions, for example, the thalamus, striatum, hippocampus, hypothalamus, 
midbrain, medula and brain stem. In addition, the VR-2 molecule is expressed in 
parasympathetic neurons of the monkey heart (atrium), nodose sensory neurons, 
25 trigeminal (TRG) sensory neurons, dorsal root ganglion sensory neurons, sympathetic 
neurons, and motor neurons of the spinal cord. The VR2 molecule is widely expressed 
in TRG and DRG neurons, being present in most small and medium sized neurons and 
also in a few of the large neurons. VR2, like VR-1, partially co-localizes with CGRP 
and substance P. 
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Trigeminal sensory neurons are recognized pain centers while sympathetic 
neurons have been shown to be involved in neuropathic pain. 



EXAMPLE 2: EXPRESSION OF RECOMBINANT hVR-1, h VR-2, AND 

5 rVR-2 PROTEIN IN BACTERIAL CELLS 

In this example, hVR-1, hVR-2, and rVR-2 is expressed as a recombinant 
glutathione-S -transferase (GST) fusion polypeptide in E. coli and the fusion polypeptide 
is isolated and characterized. Specifically, hVR-1 , hVR-2, and rVR-2 is fused to GST 
and this fusion polypeptide is expressed in E. coli, e.g., strain PEB199. Expression of 
1 0 the GST-hVR- 1 , GST-hVR-2, and GST-rVR-2 fusion protein in PEB 1 99 is induced 

with IPTG. The recombinant fusion polypeptide is purified from crude bacterial lysates 
of the induced PEB 199 strain by affinity chromatography on glutathione beads. Using 
polyacrylamide gel electrophoretic analysis of the polypeptide purified from the 
bacterial lysates, the molecular weight of the resultant fusion polypeptide is determined. 

15 

EXAMPLE 3: EXPRESSION OF RECOMBINANT hVR-1, h VR-2, 

AND rVR-2 PROTEIN IN COS CELLS 

To express the hVR-1, hVR-2, and rVR-2 gene in COS cells, the pcDN A/Amp 
vector from Invitrogen Corporation (San Diego, CA) is used. This vector contains an 

20 SV40 origin of replication, an ampicillin resistance gene, an E. coli replication origin, a 
CMV promoter followed by a polylinker region, and an SV40 intron and 
polyadenylation site. A DNA fragment encoding the entire hVR-1, hVR-2, and rVR-2 
protein and an HA tag (Wilson et al (1984) Cell 37.167) or a FLAG tag fused in-frame 
to its y end of the fragment is cloned into the polylinker region of the vector, thereby 

25 placing the expression of the recombinant protein under the control of the CMV 
promoter. 

To construct the plasmid, the hVR-1, hVR-2, and rVR-2 DNA sequence is 
amplified by PCR using two primers. The 5' primer contains the restriction site of 
interest followed by approximately twenty nucleotides of the hVR-1, hVR-2, and rVR-2 
30 coding sequence starting from the initiation codon; the 3* end sequence contains 
complementary sequences to the other restriction site of interest, a translation stop 
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codon, the HA tag or FLAG tag and the last 20 nucleotides of the hVR-1, hVR-2, and 
rVR-2 coding sequence. The PCR amplified fragment and the pCDNA/Amp vector are 
digested with the appropriate restriction enzymes and the vector is dephosphorylated 
using the CIAP enzyme (New England Biolabs, Beverly, MA). Preferably the two 
5 restriction sites chosen are different so that the hVR-1, hVR-2, and rVR-2 gene is 

inserted in the correct orientation. The ligation mixture is transformed into E. coli cells 
(strains HB101, DH5a, SURE, available from Stratagene Cloning Systems, La Jolla, 
CA, can be used), the transformed culture is plated on ampicillin media plates, and 
resistant colonies are selected. Plasmid DNA is isolated from transformants and 
10 examined by restriction analysis for the presence of the correct fragment. 

COS cells are subsequently transfected with the hVR-1, hVR-2, and rVR-2- 
pcDNA/Amp plasmid DNA using the calcium phosphate or calcium chloride co- 
precipitation methods, DEAE-dextran-mediated transfection, lipofection, or 
electroporation. Other suitable methods for transfecting host cells can be found in 

15 Sambrook, J., Fritsh, E. F., and Maniatis, T. Molecular Cloning: A Laboratory Manual 
2nd, ed, Cold Spring Harbor Laboratory, Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, NY, 1989. The expression of the hVR-1, hVR-2, and rVR-2 polypeptide 
is detected by radiolabelling ( 35 S-methionine or 35 S-cysteine available from NEN, 
Boston, MA, can be used) and immunoprecipitation (Harlow, E. and Lane, D. 

20 Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor, NY, 1988) using an HA specific monoclonal antibody. Briefly, the cells are 
labelled for 8 hours with 35 S-methionine (or 35 S-cysteine). The culture media are then 
collected and the cells are lysed using detergents (RIPA buffer, 150 mM NaCl, 1% NP- 
40, 0.1% SDS, 0.5% DOC, 50 mM Tris, pH 7.5). Both the cell lysate and the culture 

25 media are precipitated with an HA specific monoclonal antibody. Precipitated 
polypeptides are then analyzed by SDS-PAGE. 

Alternatively, DNA containing the hVR-1, hVR-2, and rVR-2 coding sequence 
is cloned directly into the polylinker of the pCDNA/Amp vector using the appropriate 
restriction sites. The resulting plasmid is transfected into COS cells in the manner 

30 described above, and the expression of the hVR-1, hVR-2, and rVR-2 polypeptide is 
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detected by radiolabelling and immunoprecipitation using an hVR-1, hVR-2, and rVR-2 
specific monoclonal antibody. 

EXAMPLE 4: ELECTROPHYSIOLOGICAL STUDIES OF VR2 

5 Human VR2 was functionally characterized in both HEK293 cells and Xenopus 

oocytes using electrophysiological methods. VR2 (in the pcDNA3.1 vector purchased 
by Invitrogen) was transiently expressed in HEK293 cells (ATCC) and recordings were 
performed 48 hours after transfection of cells using the whole-cell patch-clamp method 
(described in Bertil Hille, Ionin Channels of excitable membranes, 1992; Hammill et al. 

10 (1981) Pluger Arch. 391:85-100). The results indicate that heat stimulation (>50 °C) 
induces a rapid inactivating inward current (1-2 nA). Heat-evoked currents of VR2 
displayed profound desensitization and could be reversibly blocked by the VR1 inhibitor 
capsazepin (at a 10 ^tM concentration). In contrast to rat VR1, Capsaicin (at a 1-10 \xM 
concentration), resiniferatoxin (at a 0. 1-3 |iM concentration), and low pH (5.0-6.0) do 

15 not induce any currents from VR2. Binding studies of [ 3 H]-resiniferatoxin (NEN) to 
both human VR1 and VR2 in membranes isolated from HEK293 cell homogenates also 
indicate that resiniferatoxin (at a 0.1-10 nM concentration) has no specific binding to 
VR2 while it binds to human VR1 with high affinities. 

For the oocyte studies, human VR2 was subcloned into an oocyte expression 

20 vector containing 5'- and 3'-UTR of Xenopus p-globin (Chiara et al. (1999) 

Biochemistry 38(20)6689-6698). In vitro trasncription was carried out as described in 
Chiara et al {supra) and cRNA (10-100 ng) was then injected into the oocytes. VR2 
function was characterized in the oocytes 48 hours after cRNA injection using a standard 
two-electrode voltage-clamp. Consistent with the data from the HEK293 studies, VR2 

25 can only be activated by heat stimulation (48-50 °C) but not by vanilloid receptor 

agonists, capsaicin, or resiniferatoxin. The vanilloid receptor antagonist capsazepine (at 
a 1-10 fiM concentration) blocks the heat response of VR2 reversibly. 



BNSDOCID: <WO 0029577A1_L> 



WO 00/29577 



-96- 



PCT/US99/26701 



EXAMPLE 5: GENERATION OF ANTI-hVR-2 ANTIBODIES AND hVR-2 

PROTEIN LOCALIZATION BY IMMUNOSTAINING 

Polyclonal antisera were raised in rabbits against the following three peptides 
derived from the human VR2 amino acid sequence, using the techniques described in Ed 
5 Harlow and David Lane (1988) "Antibodies; A Laboratory Manual" Cold Spring harbor 
Laboratory Press. 

Antibody PEPTIDE 1 : AFHCKSPHRHRMVVLE (SEQ ID NO: 13) 
Antibody PEPTIDE 2: RPEAPTGPNATESVQPMEGQEDEGN (SEQ ID NO: 14) 
Antibody PEPTIDE 3: SVLEMENGYWWCRKKQRAG (SEQ ID NO: 15) 
! 0 Antisera were subsequently affinity purified using the peptide immunogen. 

The polyclonal antisera were tested for immunostaining of both monkey and rat 
dorsal root ganglion sensory neurons. Peptides 1 and 3 gave specific staining of 
subpopulations of sensory neurons that was competed with the corresponding peptide. 
This pattern of expression was very similar to the one observed using a VR-2 riboprobe. 

15 

EXAMPLE 6: CHROMOSOMAL LOCALIZATION OF hVR-1 AND 

hVR-2 

To chromosomally map the hVR-1 gene, primers were designed based on the 
sequence of hVR-1 (clone Fchrb87a6) (amplifying a 177 bp product from a human 

20 control cell line DNA and multiple faint larger products from a control Hamster cell line 
DNA by PCR). These primers were used to amplify 93 DNAs in duplicate from the 
Genebridge 4 Radiation Hybrid Panel (Research Genetics, Inc., Huntsville, AL). 

The hVR-1 primers used in the PCR mapping studies were: forward - 
TAGGAGACCCCGTTGCCACG (SEQ ID NO: 16) and reverse - 

25 GATTC ACTTGGGGACAGTGACG (SEQ ID NO: 1 7) and the PCR reactions were 
performed as follows: 5 nl Template DNA (lOng/^l), L5|il 10X Perkin Elmer PCR 
Buffer, 1.2jal Pharmacia dNTP mix 2.5 raM, 1.15fil Forward primer 6.6^iM, 1.15^1 
Reverse primer 6.6jaM, 5|al Gibco/BRL Platinum Taq .05U/^il (Hot Start), using an 
amplification profile of: 95°C for 10 minutes followed by 35 Cycles of 94°C for 40 

30 seconds, 55°C for 40 seconds, 72°C for 40 seconds, and 72°C for 5 minutes. The PCR 
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products were run on 2% agarose gels, post-stainedwith SYBR Gold (1 : 10,000 dilution 
in IX TBE), and scanned on a Molecular Dynamics 595 Fluorimager. 

The following is the vector data for the 93 Genebridge4 hybrid DNAs. These are 
in order 1-93. A "1" is a positive result, a is a negative result, a "?" is an ambiguous 
5 result. 

hVRl 1 - - 1 ? - 1 - 1 - 1 1 - - - 1 - - 1 - 1 1 - - 1 - 1 - 1 - - 1 1 - 1 i_i i i 

1 1 1 1 1 - 1 1 1 1-1-1 1 l-l 1 

10 RH linkage analysis was performed using the Map Manager QTb28 software package. 

hVRl was found to map to the p arm of human chromosome 17, 18.9 cR 3000 
telomeric to the Whitehead Institute framework marker WI-6584, and 7.7 cR 3000 
centromeric of the Whitehead framework marker WI-5436. LOD scores for linkage 
were 14.5 for WI-6584 and 19.3 for WI-5436. This region corresponds to the 

15 cytogenetic location 17pl2-13. This region is syntenic to mouse chromosome 1 1. 

To chromosomal ly map the hVR-2 gene, primers were designed from 5' UTR 
sequence of human VR2 (clone FIh21el 1) (amplifying a 166 bp product from a human 
control cell line DNA and 2 much larger faint bands from a control Hamster cell line 
DNA by PCR). These primers were used to amplify 93 DNAs in duplicate from the 

20 Genebridge 4 Radiation Hybrid Panel (Research Genetics, Inc., Huntsville, AL). 

The hVR-2 primers used in the PCR mapping studies were: forward - 
TTAAGCTCCCGTTCTCACCG (SEQ ID NO: 18) and reverse - 
GCTGCGGGAGGAAGTGAAGC (SEQ ID NO: 19) and the PCR reactions were 
performed as follows: 5[il Template DNA (10ng/nl), 1.5|il 10X Perkin Elmer PCR 

25 Buffer, 1 .2jil Pharmacia dNTP mix 2.5mM, 1 . 1 5jil Forward primer 6.6^M, 1 . 1 5^1 
Reverse primer 6.6fiM, 5\xl Gibco/BRL Platinum Taq .05U/jil (Hot Start), using an 
amplification profile of 95°C for 10 minutes, followed by 35 Cycles of 94°C for 40 
seconds, 55°C for 40 seconds, 72°C for 40 seconds, and 72°C for 5 minutes. The PCR 
products were run on 2% agarose gels, post-stainedwith SYBR Gold (1 : 10,000 dilution 

30 in IX TBE), and scanned on a Molecular Dynamics 595 Fluorimager. 
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The following is the vector data for the 93 Genebridge4 hybrid DNAs. These are 
in order 1-93. A "1" is a positive result, a is a negative result, a is an ambiguous 
result. 

5 hVR2 1 - - 1 1 - ? 1 1 1 - - 1 - 1 1 1 - 1 1 1 l - - 1 1 1 - 1 1 1 - - 1 

1 ... 1 i i i i ... i . i l-i-i 11 1 1 - -- 1 1 - 1 - 1 -?---? 

RH linkage analysis was performed using the Map Manager QTb28 software package. 

10 hVR2 was found to map to the p arm of human chromosome 1 7 , 29.3cR cR 3000 

telomeric to the Whitehead Institute framework marker D17S721, and 23.3 cR 3000 
centromeric of the Whitehead framework marker AFMA043ZB5. LOD scores for 
linkage were 1 1.9 for D17S721 and 13.6 for AFMA043ZB5. This region corresponds to 
the cytogenetic location 17pl 1-12. This region is syntenic to mouse chromosome 11. 

15 

Equivalents 

Those skilled in the art will recognize, or be able to ascertain using no more than 
routine experimentation, many equivalents to the specific embodiments of the invention 
described herein. Such equivalents are intended to be encompassed by the following 
20 claims. 
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What is claimed: 



1 . An isolated nucleic acid molecule selected from the group consisting of: 

(a) a nucleic acid molecule comprising the nucleotide sequence set 
5 forth in SEQ ID NO: 1 , 3, 4, 6, 7, 9, 1 0, or 1 2 or a complement thereof; and 

(b) a nucleic acid molecule consisting of the nucleotide sequence set 
forth in SEQ ID NO: 1, 3, 4, 6, 7, 9, 10, or 12 or a complement thereof. 

2. An isolated nucleic acid molecule which encodes a polypeptide selected 
1 0 from the group consisting of: 

(a) a polypeptide comprising the amino acid sequence set forth in 
SEQ ID NO:2, 5, 8, or 1 1 ; and 

(b) a polypeptide consisting of the amino acid sequence set forth in 
SEQ ID NO:2, 5, 8, or 11. 

15 

3. An isolated nucleic acid molecule which encodes a naturally occurring 
allelic variant of a polypeptide comprising the amino acid sequence set forth in SEQ ID 
NO:2, 5, 8, or 11. 



20 4. An isolated nucleic acid molecule selected from the group consisting of: 

a) a nucleic acid molecule comprising a nucleotide sequence which 
is at least 83% identical to the nucleotide sequence of SEQ ID NO:l, 3, 4, 6, 7, 9, 10, or 
12, or a complement thereof; 

b) a nucleic acid molecule comprising a fragment of at least 20 

25 nucleotides of a nucleic acid comprising the nucleotide sequence of SEQ ID NO:l, 3, 4, 
6, 7, 9, 10, or 12, or a complement thereof; 

c) a nucleic acid molecule which encodes a polypeptide comprising 
an amino acid sequence at least about 87% identical to the amino acid sequence of SEQ 
ID NO:2,5, 8, or 11; and 
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d) a nucleic acid molecule which encodes a fragment of a 
polypeptide comprising the amino acid sequence of SEQ ID NO:2, 5, 8, or 1 1, wherein 
the fragment comprises at least 15 contiguous amino acid residues of the amino acid 
sequence of SEQ ID NO:2, 5, 8, or 1 1. 

5 

5. An isolated nucleic acid molecule comprising the nucleic acid molecule 
of any one of claims 1, 2, 3, or 4, and a nucleotide sequence encoding a heterologous 
polypeptide. 

10 6. A vector comprising the nucleic acid molecule of any one of claims 1, 2, 

3, or 4. 

7. The vector of claim 6, which is an expression vector. 

15 8. A host cell transfected with the expression vector of claim 7. 

9. A method of expressing a polypeptide comprising culturing the host cell 
of claim 8 in an appropriate culture medium to, thereby, express the polypeptide. 
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10. An isolated polypeptide selected from the group consisting of: 

a) a fragment of a polypeptide comprising the amino acid sequence 
of SEQ ID NO:2, 5, 8, or 1 1, wherein the fragment comprises at least 15 contiguous 
amino acids of SEQ ID NO:2, 5, 8, or 1 1; 

5 b) a naturally occurring allelic variant of a polypeptide comprising 

the amino acid sequence of SEQ ID NO:2, 5, 8, or 1 1 , wherein the polypeptide is 
encoded by a nucleic acid molecule which hybridizes to a nucleic acid molecule 
consisting of SEQ ID NO:l, 3, 4, 6, 7, 9, 10, or 12 under stringent conditions; 

c) a polypeptide which is encoded by a nucleic acid molecule 
10 comprising a nucleotide sequence which is at least 83% identical to a nucleic acid 

comprising the nucleotide sequence of SEQ ID NO:l, 3, 4, 6, 7, 9, 10, or 12; and 

d) a polypeptide comprising an amino acid sequence which is at least 
87% identical to the amino acid sequence of SEQ ID NO:2, 5, 8, or 1 1 . 

15 11. The isolated polypeptide of claim 1 0 comprising the amino acid sequence 

of SEQ ID NO:2,5, 8, or 11. 

12. The polypeptide of claim 10, further comprising heterologous amino acid 
sequences. 

20 

13. An antibody which selectively binds to a polypeptide of claim 10. 

14. A method for detecting the presence of a polypeptide of claim 10 in a 
sample comprising: 

25 a) contacting the sample with a compound which selectively binds to 

the polypeptide; and 

b) determining whether the compound binds to the polypeptide in 
the sample to thereby detect the presence of a polypeptide of claim 10 in the sample. 

30 15. The method of claim 14, wherein the compound which binds to the 

polypeptide is an antibody. 
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16. A kit comprising a compound which selectively binds to a polypeptide of 
claim 1 0 and instructions for use. 

5 1 7. A method for detecting the presence of a nucleic acid molecule of any 

one of claims 1, 2, 3, or 4 in a sample comprising: 

a) contacting the sample with a nucleic acid probe or primer which 
selectively hybridizes to the nucleic acid molecule; and 

b) determining whether the nucleic acid probe or primer binds to a 
10 nucleic acid molecule in the sample to thereby detect the presence of a nucleic acid 

molecule of any one of claims 1, 2, 3, or 4 in the sample. 

18. The method of claim 17, wherein the sample comprises mRNA 
molecules and is contacted with a nucleic acid probe. 

15 

19. A kit comprising a compound which selectively hybridizes to a nucleic 
acid molecule of any one of claims 1, 2, 3, or 4 and instructions for use. 

20. A method for identifying a compound which binds to a polypeptide of 
20 claim 10 comprising: 

a) contacting the polypeptide, or a cell expressing the polypeptide 
with a test compound; and 

b) determining whether the polypeptide binds to the test compound. 

25 21. The method of claim 20, wherein the binding of the test compound to the 

polypeptide is detected by a method selected from the group consisting of: 

a) detection of binding by direct detection of test 
compound/polypeptide binding; 

b) detection of binding using a competition binding assay; and 

30 c) detection of binding using an assay for hVR- 1 , hVR-2, or rVR-2 

activity. 
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22. A method for modulating the activity of a polypeptide of claim 10 
comprising contacting the polypeptide or a cell expressing the polypeptide with a 
compound which binds to the polypeptide in a sufficient concentration to modulate the 
activity of the polypeptide. 



23. A method for identifying a compound which modulates the activity of a 
polypeptide of claim 10 comprising: 



1 0 polypeptide to thereby identify a compound which modulates the activity of the 
polypeptide. 

24. A method for treating a subject having a disorder characterized by 
aberrant hVR-1 or hVR-2 protein activity or nucleic acid expression comprising 

15 administering to the subject a hVR-1 or hVR-2 modulator such that treatment of the 
subject occurs. 

25. The method of claim 24, wherein the hVR-1 or hVR-2 modulator is a 
small molecule. 

20 



5 



a) 
b) 



contacting a polypeptide of claim 10 with a test compound; and 
determining the effect of the test compound on the activity of the 



26. 



The method of claim 24, wherein the disorder is a pain disorder. 
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V35 

human VRi gene with translation of open reading frame 

Input file Fchrb87a6 .eeq; Output File Fchrb87a6 . tra 
Sequence length 3909 

GTGAGCG CAACGCACTGCGGG CAGTG AGCG CAACGCACTGCGGGCAGTGAGCGCAACGCACTGCGGGCAG TGAGCGCAA 
CGCACTGCGGGCAGTGAGCGCAACGCACTGCGGGC^C^ 

GCAG TG AG CGCAACG CA CTTG CGGG CAG TG AG CG CAACGCACTGCGGG CAG TG AG CGCAA CG CACTGCGGG CAGTG AGC 

GC^IACGCACTGCGGGCAGTGAGCGCAACGCACTG 

GKX3GGCAGTGAGCGCAACGCACTGCGGGCAGTGAGCGCAA 

AGCGCAACGCACTGCGGGCAGTGAGCGCAACGCACTTAATGTGAGTTAGCTCACTCATTAGGCACCCCAGGCTTTACAC 
TTTATGCTTCCG GCTCG TATGTTG TGTGG AATTGTG AG CGG ATAACAATTTCACACAGGAAACAGCTATG ACCATGATT 
ACGCCAAGCTCTTAATACGACTC^CTATAGGGAAAGCTGGTACGCCTG CAGG TACCGGTCCGGAATTCCCGGGTCG ACCC 
ACX3CX?TCtt3AAAACACACCTCTCTG 

GTGCTCTGGGGAGAATTC<3TAGATCATCCTCAGAAAAGCCn*^^ 
CX^GAAAAGCTGTCCACAGTAGTCCCCCCTTATCCACGGG 
GGTCTGCCAATATTAAATGGAAAATTCTTCAAAGAGT^ 
AGAGTCTCTGCCGTGCCATCTGGGATGCJ^ 

AGTXIACTTAGTCGTCAG ATCGCCCGTCCTGGTATCA CACACTGGG CCACAGAGGATCCA 

MKKHSS TDLGTAADPL.QK 18 
GGAAGG ATG AAG AAA TGG AGC AGC ACA GAC TTG GGG ACA GCT GCG GAC CCA CTC CAA AAG 54 

DTCPDPLDGDPNSRPPPAKP 38 
OAC ACC TGC CCA GAC CCC CTG GAT GGA GAC CCT AAC TCC AGG CCA CCT CCA GCC AAG CCC 114 

QLPTAKSRTRLFGKGDSEEA 58 
CAG CTC CCC ACG GCC AAG AGC CGC ACC CGG CTC TTT GGG AAG GOT GAC TCG GAG GAG GCT 174 

FPVDCPH BEGELDSCPTITV 78 
TTC CCG GTG GAT TGC CCC CAC GAG GAA GGT GAG TTG GAC TCC TGC CCG ACC ATC ACA GTC 234 

£ PV I TI Q R P GDG PTGARI«IiS' 98 
AGC CCT GTT ATC ACC ATC CAG AGG CCA GGA GAC GGC CCC ACC GGT GCC AGG CTG CTG TCC 294 

QDS'VAASTBK TLRLYDRRSI 
CAG GAC TCT GTC GCC GCC AGC ACC GAG AAG ACC CTC AGG CTC TAT GAT CGC AGG AGT ATC 354 

F E A V A O N N C Q D L E S L L L F L Q 138 
TTT GAA GCC GTT GCT CAG AAT AAC TGC CAG GAT CTG GAG AGC CTG CTG CTC TTC CTG CAG 414 

FIGURE 1 
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KSKKHLTDNEFKDPETGKTC 1S8 
AAG AGC AAG AAG CAC CTC ACA GAC AAC GAG TTC AAA GAC CCT GAG ACA GGG AAG ACC TGT 4 74 



LLKAMLNLHDG 
CTC CTG AAA GCC ATG CTC AAC CTG CAC GAC GGA 

EIARQTDSLKE 
GAG ATC GCG CGG CAA ACG GAC AGC CTG AAG GAG 

YYKGQTALHIA 
TAC TAC AAG GGC CAG ACA GCA CTG CAC ATC GCC 

TLLVENGADVQ 
ACC CTC CTG GTG GAG AAC GGA GCA GAC GTC CAG 

KTKGR PGFYFG 
AAA ACC AAA GGG CGG CCT GGA TTC TAC TTC GGT 

TNOLGIVKFLL 
ACC AAC CAG CTG GGC ATC GTG AAG TTC CTG CTG 

SARDSVGNTVL 
AGC GCC AGG GAC TCG GTG GGC AAC ACG GTG CTG 

TADNTKFVTSM 
ACG GCC GAC AAC ACG AAG TTT GTG ACG AGC ATG 

KLHPTLKLEEI* 
AAA CTG CAC CCG ACG CTG AAG CTG GAG GAG CTC 

ALAAGTGK IGV 
GCT CTG GCA GCT GGG ACC GGG AAG ATC GGG GTC 

QEPECRHLSRK 
CAG GAG CCC GAG TGC AGG CAC CTG TCC AGG AAG 

HSSLYDXjS CID 
CAC TCC TCG CTG TAC GAC CTG TCC TGC ATC "GAC 

VIAYSfiSETPN 
GTG ATC GCC TAC AGC AGC AGC GAG ACC CCT AAT 

IfKRI*LQDKWDR 
CTG AAC CGA CTC CTG CAG GAC AAG TGG GAC AGA 

FLVYCIiYMIIF 
TTC CTQ GTC- TAC TGC CTG TAC ATG ATC ATC TTC 

DGLPPFKMEKI 
GAT GGC TTG CCT CCC TTT AAG ATG GAA AAA ATT 

ILSVLGGVYFF 
ATC CTG TCT GTG TTA GGA GGA GTC TAC TTC TTT 



QNTTIPLLL 178 

CAG AAC ACC ACC ATC CCC CTG CTC CTG 534 

LVNASYTD S 198 

CTT GTC AAC GCC AGC TAC ACG GAC AGC 594 

IERRNMALV 218 

ATC GAG AGA CGC AAC ATG GCC CTG GTG 6 54 

AAAHGDFFK 238 

GCT GCG GCC CAT GGG GAC TTC TTT AAG 714 

ELPLSLAAC 258 

GAA CTG CCC CTG TCC CTG GCC GCG TGC 7 74 

QNS WQ TADI 278 

CAG AAC TCC TGG CAG ACG GCC GAC ATC 834 

HALVEVADN 298 

CAC GCC CTG GTG GAG GTG GCC GAC AAC 894 

YNE I L M L G A 318 

TAC AAT GAG ATT CTG ATG CTG GGG GCC .954 

TN KKGMTPL 338 

ACC AAC AAG AAG GGA ATG ACG CCG CTG 1014 

LAYILQREI 358 

TTG GCC TAT ATT CTC CAG CGG GAG ATC 1074 

FT E WAYGPV 378 

TTC ACC GAG TGG GCC TAC GGG CCC GTG 1134 

TCEKNSVLB 398 

ACC TGC GAG AAG AAC TCG GTG CTG GAG 1194 

RHDMLLVBP 418 

CGC CAC GAC ATG CTC TTG GTG GAG CCG 1254 

FVKRIFYFN 438 

TTC GTC AAG CGC ATC TTC TAC TTC AAC 1314 

TMAAYYRPV 458 

ACC ATG GCT GCC TAC TAC AGG CCC GTG 1374 

GOYFRVTGE 478 

GGA GAC TAT TTC CGA GTT ACT GGA GAG 1434 

FRG IQYFLQ 498 

TTC CGA GGG ATT CAG TAT TTC CTG CAG 1494 
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RRPSMKTLFVD 
AGG CGG COG TCG ATG AAG ACC CTG TTT GTG GAC 

Q S LFMLATVVL 
CAG TCA CTG TTC ATG CTG GCC ACC GTG GTG CTG 

ASMVFSLALGW 
GCT TCC ATG GTA TTC TCC CTG GCC TTG GGC TGG 

FQQMG IYAVMI 
TTC CAG CAG ATG GGC ATC TAT GCC GTC ATG ATA 

RFMFVYIVFLF 
CGT TTC ATG TTT GTC TAC ATC GTC TTC TTG TTC 

I EDGKNDSLPS 
ATT GAA GAC GGG AAG AAT GAC TCC CTG CCG TCT 

PACRPPDSSYN 
CCT GCC TGC AGG CCC CCC GAT AGC TCC TAC AAC 

FKFTIGMGDLE 
TTC AAG TTC ACC ATC GGC ATG GGC GAC CTG GAG 

VFII L L L» A Y V I 
GTC TTC ATC ATC CTG CTG CTG GCC TAT GTA ATT 

LIALMGETVNK 
CTC ATC GCC CTC ATG GGT GAG ACT GTC AAC AAG 

KLQR AITILDT 
AAG CTG CAG AGA GCC ATC ACC ATC CTG GAC ACG 

KAFRSGKLLQV 
AAG GCC TTC CGC TCA GGC AAG CTG CTG CAG GTG 

YRWCFRVDEVN 
TAC CGG TGG TGC TTC AGG GTG GAC GAG GTG AAC 

I INEDPGHC BG 
ATC ATC AAC GAA GAC CCG GGC AAC TOT GAG GGC 

RSSRVSGRHWK 
CGG TCA AGC AGA GOT TCA GGC AGA CAC TGG AAG 

EASARDRQSAQ 
GAG GCA AGT GCT CGA GAT AGG CAG TCT GCT CAG 

SGSItKPSDABV 

TCA GGG TCT CTG AAG CCA GAG GAC GCT GAG GTC 

K * 

AAG TGA 



SYSEMLFFL 5X8 

AGC TAC AGT GAG ATG CTT TTC TTT CTG 1554 

YFSHLKEYV 538 

TAC TTC AGC CAC CTC AAG GAG TAT GTG 1614 

TNMLYYTRG 558 

ACC AAC ATG CTC TAC TAC ACC CGC GGT 1674 

EKMILRDLC 578 

GAG AAG ATG ATC CTG AGA GAC CTG TGC 1734 

G FSTAVVTL 598 

GGG TTT TCC ACA GCG GTG GTG ACG CTG 1794 

ESTSHRWRG 618 

GAG TCC ACG TCG CAC AGG TGG CGG GGG 18 54 

SLYSTCLEL 638 

AGC CTG TAC TCC ACC TGC CTG GAG CTG 1914 

FTENYDFKA 658 

TTC ACT GAG AAC TAT GAC TTC AAG GCT 1974 

LTYI LLLNM 678 

CTC ACC TAC ATC CTC CTG CTC AAC ATG 2034 

IAQESKNIW 698 

ATC GCA CAG GAG AGC AAG AAC ATC TGG 2094 

EKSFLKCMR 718 

GAG AAG AGC TTC CTT AAG TGC ATG AGG 2154 

GYTPDGKDD 738 

GGG TAC ACA CCT GAT GGC AAG GAC GAC 2214 

WTTWKTNVG 758 

TGG ACC ACC TGG AAC ACC AAC GTG GGC 2274 

VKRTLSFSli 778 

GTC AAG CGC ACC CTG AGC TTC TCC CTG 2334 

NFALVPLLR 798 

AAC TTT GCC CTG GTC CCC CTT TTA AGA 2394 

PEEVYLRQF 818 

CCC GAG GAA GTT TAT CTG CGA CAG TTT 2454 

FKSPAAfiGE 638 

TTC AAG AGT CCT GCC GCT TCC GGG GAG 2514 

840 
2520 
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Pull -length human VR2 

Input flic Flh21ell.eeq; Output File Flh21ell.tra 
Sequence length 2809 

oo<rrAoccTOTCCTOAawoqQOAGA^ 

AQAGACaVGAACCTGCI'lXXriXXSAQCTTAGTGC^ 

GGCAGCCCCTCCCGGCTTCACTTCXriXX^ 

GOCCTCAGCCTCCGGGGCTCCAGTCAGGC^^ 

MTSPSSSP 6 

TGCACAGJ\GGTC^7IX^3<7n3GACOGAGCAGCC^ ATG ACC TCA CCC TCC AGC TCT CCA 24 

VFRLETLDGGQEDGSEADRG 28 

GTT TTC AOG TTG GAG ACA TTA GAT GGA GGC CAA GAA GAT GGC TCT GAG GOG GAC AGA GGA 84 

KLDFGSGLPPMESQFQGEDR 48 

AAG CTG GAT TTT GGG AGC GGG CTG CCT CCC ATG GAG TCA CAG TTC CAG GGC GAG GAC CGG 144 

KFAPQI R V N !• N Y R KGTGASO 68 

AAA TTC GCC CCT CAG ATA AGA GTC AAC CTC AAC TAC CGA AAG GGA ACA GGT GCC AGT CAG 204 

PDPNRFDRDRLFNAVSR GVP 68 

COG CAT OCA AAC CGA TTT GAC CGA GAT CGG CTC TTC AAT GOG GTC TCC CGG GGT GTC CCC 264 

EDLAGLPE YLSKTSKYLTDS 106 

GAG GAT CTG GCT GGA CTT CCA GAG TAC CTG AGC AAG ACC AGC AAG TAC CTC ACC GAC TOG 324 

BYTEGSTG KTCLMKAVI.NX.K 128 

GAA TAC ACA GAG GGC TCC ACA GGT. AAG AOG TGC CTG ATG AAG GCT GTG CTG AAC CTT AAG 384 

DG VNAC I I* P L L Q I D R D S GK P 148 

GAC GGA GTC AAT GCC TGC ATT CTG CCA CTG CTG CAG ATC GAC AGG GAC TCT GGC AAT CCT 444 

Q P LVNAQ C T DD Y Y R G H S A L H 168 

CAG CCC CTG GTA AAT GCC CAG TGC ACA GAT GAC TAT TAC CGA GGC CAC AGC GCT CTG CAC 504 

IA I K K R S !• O CVKI* L V E N G A N 188 

ATC GCC ATT GAG AAG AGG AGT CTG CAG TGT GTG AAG CTC CTG GTG GAG AAT GGG GCC AAT. 564 

V H A R A C_ OR PPOKGQGTCFYF 208 

GTG CAT GCC CGG GCC TGC GGC CGC TTC TTC CAG AAG GGC CAA GGG ACT TGC TTT TAT TTC 624 

O B I»PX.SX.A ACT'ieQ.WDV VGTI- 228 

GGT GAG CTA CCC CTC TCT TTG GCC OCT TGC ACC AAG GAG TOG GAT GTG GTA AGC TAC CTC 684 

LBHPBQPAfiLQA TDSQGNT V 248 

CTG GAG AAC CCA CAC CAG CCC GCC AGC CTG GAG GCC ACT GAC TCC CAG GGC AAC ACA GTC 744 

L8A LVKX6 DH& ABH XAItVTS 268 

CTG CAT GCC CTA GTG ATG ATC TCO GAC AAC TCA GCT GAG AAC ATT GGA CTG GTG ACC AGC 604 

M YDGZtXfQAGARZ«CPTVQX*BD. 288 

ATG TAT GAT GOG CTC CTC CAA GCT GGG GCC CGC CTC TGC CCT ACC GTG GAG CTT GAG GAC 664 

XRHLQDLT PLKbAAKBGKIE 308 
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ATC CGC AAC CTO CAQ OAT CTC ACG CCT CTG AAG CTO OCC OOC AAO GAO GOC AAG ATC GAG 924 

IFRHILQRBFSO LSHLSRKF 326 

ATT TTC AGG CAC ATC CTG GAG CGG GAG TTT TCA GGA CTG AGC CAC CTT TCC OGA AAG TTC 984 

TEWCVGPVR VSLYDLASVDS 346 

ACC GAG TOO TOC TAT GGG CCT GTC COG CTG TOG CTO TAT GAC CTG OCT TCT GTG GAC AGC 1044 

CEEKSVLEIIAFHCKSPHRH 366 

TGT GAG GAG AAC TCA GTG CTG GAG ATC ATT GCC TTT CAT TGC AAG AGC COG CAC CGA CAC 1104 

RMVVLEPLHKLI-QAKWOLLI 388 

CGA ATG GTC GTT TTG GAG OCC CTG AAC AAA CTG CTG CAG GCC AAA TOG GAT CTG CTC ATC 1164 

PKFFI-NF LCNI,I*MFIFTAV 408 

CCC AAG TTC TTC TTA AAC TTC CTG TGT AAT CTG ATC TAC ATG TTC ATC TTC ACC GCT GTT 1224 

A YHQPTLKKQAAPHLKAEVG 428 

GCC TAC CAT CAG CCT ACC CTG AAG AAG CAG GCC GCC CCT CAC CTG AAA GCG GAG GTT GGA 1284 

KSMLLTGH I LI L L G G I Y L» L V 448 

AAC TCC ATG CTG CTG ACG GGC CAC ATC CTT ATC CTG CTA GGG GGG ATC TAC CTC CTC GTG 1344 

GQLWYFWRRHVFIWISFIDS 468 

GGC CAG CTG TGG TAC TTC TGG CGG CGC CAC GTG TTC ATC TGG ATC TOG TTC. ATA GAC AGC 1404 

TPB Xl.FI.rQAI.I.TVV. -"SQV1.C 488 

TAC TTT GAA ATC CTC TTC CTG TTC CAG GCC CTG CTC ACA GTG GTG TCC CAG GTG CTG TGT 1464 

-iAIEW*I.*X-I.VSAI.VIiGWI. 508 

TTC CTG GCC ATC GAG TGG TAC CTG CCC CTG CTT GTG TCT GOG -CTG GTG CTG GGC TOG CTG 1524 

MI.I,YYTRGFQHTGIYSVMIQ 528 

AAC CTG CTT TAC TAT ACA OGT GGC TTC CAG CAC ACA GGC ATC TAC ACT GTC ATG ATC CAG 1584 

,rviI.RDl,I-RFX'I' lYI ' VFIiPG 548 

AAG GTC ATC CTG CGG GAC CTG CTG CGC TTC CTT CTG ATC TAC TTA GTC TTC CTT TTC GGC 1644 

FAVALVS1-SQBAWRPEAPTG 568 

TTC OCT GTA GCC CTG GTG AGC CTG AGC CAG GAG GCT TGG CGC CCC GAA GCT CCT ACA GGC 1704 

»»*TEfiVGPMBGQBDEGHGA 588 

cccaLgc^acagLtcagto 1764 

CAG TAC AGO OGT ATC CTO GAA GOC TCC TTG GAG CTC TTC AAA TTC ACC ATC GOC ATG GGC 1624 

»wt.I.*YXl.I.I.»-"' 1 ' XA, ' HSBT 646 

W^mW.LlL^OTCIOCK^mCTC^^CKMO^™^ 1944 

TVO TK PD08P08R« CFRVBE 708 

FI GURE 2 (continued) 
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Partial human VR2 alternate form 

Input file f rhobl2c4 .eeq; Output Flic frhobl2c4 . tra 
Sequence length 1469 

ORFFQKGQGTCFYFGSL'FL 
GC GGC CGC TTC TTC CAG AAG GGC CAA OGG ACT TGC TTT TAT TTC GOT GAG CTA CCC CTC 

S L A A C T K Q HDVVS Y L L E N P H 
TCT TTC GCC GCT TGC ACC AAG CAG TGG GAT GTG GTA AGC TAC CTC CTG GAG AAC CCA CAC 



19 
57 



39 
117 



QPASLQATDSQGNTVLHALV 59 
CAG CCC GCC AGC CTG CAG GCC ACT GAC TCC CAG GGC AAC ACA GTC CTG CAT GCC CTA GTG 177 

M I S DHSA E N I A L V TS M Y DG L 7$ 
ATG ATC TOG GAC AAC TCA GCT GAG AAC ATT GCA CTG GTG ACC AGC ATG TAT GAT GGG CTC 237 

LOAGARLCPTVQLEDIRN LO 99 

CTC CAA GCT GGG GCC CGC CTC TGC CCT ACC GTG CAG CTT GAG GAC ATC CGC AAC CTG CAG 297 

DLTPLKLAAK.EGKIEIFRHI n 9 

GAT CTC ACG CCT CTG AAG CTG GCC GCC AAG GAG GGC AAG ATC GAG ATT TTC AGG CAC ATC 357 

L Q R E F S G L S H L S R KFTEWCY 139 

CTG CAG CGG GAG TTT TCA GGA CTG AGC CAC CTT TCC CGA AAG TTC ACC GAG TGG TGC TAT 417 

GPVRVSLYDL ASVDSCEENS 159 

GGG CCT GTC CGG GTG TOG CTG TAT GAC CTG GCT TCT GTG GAC AGC TGT GAG GAG AAC TCA 477 

VLSI IAFHCKSPHRKRMVVL 179 

GTG CTG GAG ATC ATT GCC TTT CAT TGC AAG AGC COG CAC CGA CAC CGA ATG GTC GTT TTG 537 

E P L N K L L Q A K HO 1* LI P KFF L 199 

GAG CCC CTG AAC AAA CTG CTG CAG GOG AAA TGG GAT CTG CTC ATC CCC AAG TTC TTC TTA S97 

HFLCHLZ YMFIFTAVAYHQP 219 

AAC TTC CTG TGT AAT CTG ATC TAC ATG TTC ATC TTC ACC GCT GTT GCC TAC CAT CAG CCT 6S7 

T X* . K K Q A A P H I* K A E VGN S ML L 239 

ACC CTG AAG AAG CAG GCC GCC CCT CAC CTG AAA GOG GAG GTT GGA AAC TCC ATG CTG CTG 717 

TGHXLXLLGGXYLLVG QLKY 259 

ACG GGC CAC ATC CTT ATC CTG CTA GGG GGG ATC TAC CTC CTC GTG GGC CAG CTG TGG TAC 777 

~F~ WR-RBVF X.ff X CF X D6 < Y FBI 279" 

TTC TGG CGG COC GAC GTG TTC ATC TGG ATC TOO TTC ATA GAC AGC TAC TTT GAA ATC CTC 637 

FLFQAIiLTVVfiQV I.CFI. AIB 299 

TTC CTG TTC CAG GCC CTG CTC ACA GTG GTG TCC GAG GTG CTG TGT TTC CTG GCC ATC GAG 897 

MYLPLL VfiALVLGHLHLLYY 319 - 

TOG TAC CTG CCC CTG CTT GTG TCT GOG CTG GTG CTG GGC TGG CTG AAC CTG CTT TAC TAT 957 

T RGFQBTG X Y 6 V H X QKK AX B 339 
ACA COT GGC TTC GAG CAC ACA GGC ATC TAC AGT GTC ATG ATC CAG AAG AAA GCC ATC TCT 10X7 

V LBKBHOYHWCRICKQRAGVH 359 
GTC CTG GAG ATG GAG AAT GGC TAT TGG TGG TGC AGO AAG AAG GAG COG GCA GOT GTG ATG 1077 
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Partial rat VR2 

Input file Plrxbl4 7gll.8eq; Output File Flrxbl47gll . tra 
Sequence length 1794 

STHASALS I#AACTKQWDVV 19 

G TOO ACC CAC GCG TCC GCT CTT TCT CTG OCT GCG TGC ACC AAG CAG TOG GAT GTG GTG 57 

TYLLENP HQ PAS LEATDSLG 39 

ACC TAC CTC CTG GAG AAC CCA CAC CAG CCG GCC AGC CTG GAG GCC ACC GAC TCC CTG GGC 117 

HTVLHALVMIADNSPENSAL 59 

AAC ACA GTC CTG CAT GCT CTG GTA ATG ATT GCA GAT AAC TOG CCT GAG AAC AGT GCC CTG 177 

VIHMYDGLLQMGARLCPTVO 79 

GTG ATC CAC ATG TAC GAC GGG CTT CTA CAA ATG GGG GOG CGC CTC TGC CCC ACT GTG CAG 237 

LEEISNHQGLTPLKLAAKEG 99 

CTT GAG GAA ATC TCC AAC CAC CAA GGC CTC ACA CCC CTG AAA CTA GCC GCC AAG GAA GGC 297 

K I E I FRH I t* Q R E FSGPYQPL 119 

AAA ATC GAG ATT TTC AGG CAC ATT CTG CAG CGG GAA TTC TCA GGA CCG TAC CAG CCC CTT 357 

SRKFTEMCYGPVRVSLYDLS 139 

TCC CGA AAG TTT ACT GAG TGG TGT TAC GGT CCT GTG OGG GTA TOG CTG TAC GAC CTG TCC 417 

SVDSWEK N S V I* E I IAFHCKS 159 

TCT GTG GAC AGC TGG GAA AAG AAC TOG GTG CTG GAG ATC ATC GCT TTT CAT TGC AAG AGC 477 

PNRHRMVVLEPX-HKLLQEKW 179 

COG AAC OGG CAC CGC ATG GTG GTT TTA GAA CCA CTG AAC AAG CTT CTG CAG GAG AAA TGG 537 

DR LVSRFFFKFACYLVYMFI 199 

OAT OGG CTC GTC TCA AGA TTC TTC TTC" AAC TTC GCC TGC TAC TTG GTC TAC ATG TTC ATC S97 

FTVVAYHQ P E I* D Q PAI PSSK 219 

- TTC ACC GTC GTT GCC TAC CAC CAG CCT TCC CTG GAT CAG CCA GCC ATC CCC TCA TCA AAA 657 

ATFGSSMLX.I.GHXI..XI.LGGI 239 

OCG ACT TTT GGG GAA TCC ATG CTG CTG CTG GGC CAC ATT CTG ATC CTG CTT GGG GGT ATT 717 

YI-I.I.GQLWTFWRRRLFIWXS 259 

TAC CTC TTA CTG GGC CAG CTG TOG TAC TTT TOG CGG COG CGC CTG TTT ATC TGG ATC TCA 777 

F M"~T> 6 ~ Y" ~ F ~ B X !• F 1* *» O A I* I* T V I* & 279 

TTC ATG GAC AGC TAC TTT GAA ATC CTC TTT CTC CTT CAG OCT CTO CTC ACA GTG CTG TCC 837 

OVI.RFMBTBWYI. PI. I.VX.SI.V 299 

CAG GTG CTG CGC TTC ATG GAG ACT GAA TOO TAC CTA CCC CTG CTA GTG TTA TCC CTA GTG 897 

LGMI.KLLYTTROF0HTGIY 6 319 

CTG GGC TOG CTG AAC CTG CTT TAC TAC ACA CGG GGC TTT GAG CAC ACA GGC ATC TAC AGT 9S7 

VMXQKVXI.RDI.X'RFLI.VYLV 339 

OTC ATG ATC CAG AAG GTC ATC CTT CGA GAC CTG CTC COT TTC CTG CTO GTC TAC CTG GTC 1017 

F&FGFAVAI.V61.GRSARSPK 3B * 

TTC CTT TTC GGC TTT GCT GTA GCC CTA GTA AGC TTG AGC AGA GAG GCC CGA AGT CCC AAA 1077 
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GAP of: humanvr2.pep check: 5746 from: 1 to: 764 
humanVR2 Flh21ell 

to: humanvrl.pep check: 6877 from: 1 to: 839 
humanVRl _Fbhl8547pat - fchrb87a6, 3909 bases, 4554 checksum. 

Symbol comparison table: 
/ddm_local/gcg/gcg_9. l/gcgcore/data/rundata/blosum62 . cmp 

CompCheck: 6430 

Gap Weight: 12 Average Match: 2.912 

Length Weight: 4 Average Mismatch: -2.003 

Quality: 1530 Length: 850 

Ratio: 2.003 Gaps: 10 

Percent Similarity: 55.378 Percent Identity: 46.348 

Match display thresholds for the alignment ( s ) : 
| = IDENTITY 
: = 2 
. = 1 

humanvr2 .pep x humanvrl.pep 

n MTSPSSSPVF 10 

I. I . .1 

1 MKKWSSTDLGTAADPLQKDTCPDPLDGDPNSRPPPAKPQLPTAKSRTRLF 50 

11 RLETLDGGQEDGSEADRGKLDFGSGLPPMESQFQGEDRKFAPQIRVNLNY 60 

: : I . 1 I : • : • 1 - 

51 GKGDSEEAFPVDCPHEEGELDSCPTI. TVSPVITIQRPGDGPTGARLLSQ 99 

61 RKGTGASQPDPNRFDRDRLFNAVSRGVPEDLAGLPEYLSKTSKYLTDSEY 110 

:ll :| II.. -II I :l I. I:lll.|: 
100 DSVAASTEKTLRLYDRRSIFEAVAQNNCQDLESLLLFLQKSKKHLTDNEF 14 9 

Ill TEGSTGKTCLMKAVLNLKDGVNACI LPLLQI DRDSGNPQPLVNAQCTDDY 160 

: 111111:11.111 II I I M:l I • • • MM M I 
150 KDPETGKTCLL'KAMLNLH DGQNTT I PLLLE I ARQTDSLKELVNAS YT DS Y 199 

161 YRGHSTO^I AIEKRSUK^^LVENGANVHARACGRFFQKGQG . TCFYFG 209 

|:| I IIMMI.I I I I M.I .1 MM 

200 YKGQTALHIAIERRNMALV-TLLVENGADVQAAAHGDFFKKTKGRPGFYFG 24 9 

210 ELPLS IAACTKQWDWS YLLEN P HQPASLQAT DSQGNTVLHALVMI S DNS 259 

|| || | :| I I : 1 M I I MM Ml :.M. 

250 ELPLSLAACTNQLGIVKFLLQNSWQTADISARDSVGNTVLHALVEVADNT 299 

260 AENIAIjVTS^DGLLQAGARI^PTVQLEDIRNLQDLTPLiKLA^ 309 

|:| Mill. M MM II..M:: I . MM. Ill Ml : 
300 ADNTKFVTSMYNEI LMI/3AKLHPTLKLEELTNKKGMTPI*AI*AAGTGKIGV 349 
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310 FRHILQREFS. . GLSHLSRKFTEWCYGPVRVSLYDIASVDSCEENSVLEI 357 

: I M 1 I I I I I I I I I I I I I I Mill- : I • I I • I I I M : 

350 LAYI LQREIQEPECRHLSRKFTEWAYGPVHSSLYDLSCI DTCEKNS VLEV 399 

358 IAF. HCKSPHRHRMWLEPLNKLLQAKWDLLIPK. FFLNFLCNLI YMFIF 405 

||: I . I I I ... I I I I : I I I I I I : : I : I N Hill 

400 IAYSSSETPNRHDMLLVEPLNRLLQDKWDRFVKRI FYFNFLVYCLYMI I F 4 49 

406 TAVAYHQPTLKKQAAPHLKAEVGNSMLLTGHILILLGGI YLLVGQLWYFW 455 

| | | : . I I - : I . .1111.111:1 : M 

450 TMAAYYRPV. .DGLPPFKMEKIGDYFRVTGEILSVLGGVYFFFRGIQYFL 497 

4 56 RRH VFI WI S FI DS Y FEI LFLFQALLT WSQVLC FLAI EW YLPLLVSALVL 505 

.1 . |:IMI.II LI -.Ml : . I - : I • I I 
4 98 QRRPSMKTLFVDSYSEMLFFLQSLFMLATVVLYFSHLKEYVASMVFSLAL 547 

• • * * 

506 GWLNLLYYTRG FQHTG I YS VMI QKV I LRDLLRFLL I YLVFLFGFAVALVS 555 

|| |:|||||||| 111-111:1.11111 M: : I : I I I I I I . I . I . 
548 GWTNMLYYTRGFQQMGIYAVMIEKMILRDLCRFMFVYIVFLFGFSTAVVT 597 

» • * * * 

556 LSQEAWRPEAPTGPNATESVQPMEGQEDEGNGAQYRGILEASLELFKFTI 605 

I . . |. . | | . I : Ml III II 

598 LIEDGKNDSLPSESTSHRWRGPACRPPD SSYNSJ/YSTCLELFKFTI 64 3 



606 GMGELAFQEQLHFRGMVLLLLLAYVLLTYILLLNMLIALMSETVNSVATD 655 

|||:| | | | : - :: I I I I I I : I 1 I I I I I I II II I I MM : I : 
GMGDLEFTENYDFKAVFIILLLAYVILTYILLLNMLIALMGETVKKIAQE 



644 



693 



656 SWSIWKLQKAISVLEMENGYWWC.RKKQRAGVMLTVGTKPDGSPDERWCF 704 

I .11111:11. :l: I : IN LI M M III "MM 
694 SKNIWKLQRAITILDTEKS^KCblRKAFRSGKLIiQVGYTPDGKDDYRWCF 743 

705 RVEEVNWASWEQTLPTLCEDPSGA „ GVPRTLENPVIAS PPKEDEDGASEE 753 

| | : | | || . I . : I M II I M • - I I 
744 RVDEVNWTTWNTNVGI INEDPGNCEGVKRTLS FSLRSS RVSGRHWK 789 



754 NYVPVQLLQSN 

I * I II* 

7 90 NFALVPLLREASARDRQSAQPEEVYLRQFSGSLKPEDAEVFKSPAASGEK 



764 
839 
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GAP of: humanvr2.seq check: 8853 from: 1 to: 2809 
humanVR2 21611a, 2809 bases, 8853 checksum. 

to: humanvrl - seq check: 4554 from: 1 to: 3909 

humanVRl Fbhl854 7pat - Import - complete 

Symbol comparison table: 
/ddm local /gcg/gcg_9. 1 /gcgcore/data/rundata/nwsgapdna - cmp 
CompCheck: 87 60 

Gap Weight: 50 Average Match: 10.000 

Length Weight: 3 Average Mismatch: 0.000 

Quality: i4359 Length: 3934 

Ratio: 5.112 Gaps: 15 

Percent Similarity: 55.316 Percent Identity: 55.316 

Match display thresholds for the alignment (s ) : 
| = IDENTITY 
: = 5 
. * 1 



huaanvr2 . seq ac humaiwxl . seq 



1 GGCTAGCCTGTCCTGACAGGGGAGAG 26 

I I I II Hill 
801 TGTCCACAGTAGTCCCCCCTTATCCACGGGTGTCACTTTCCATGGGTTCA 850 

27 TTAAGCTCCCGTTCTCCACCGTGCCGGCTGGCCAGGTGGGCTGAGGGITGA 76 

I I I I I I MM II I I I I II 

851 GTTATTTGCGGTCAACCACGGTCTGCCAATATTAAATGGAAAATTCTTCA 900 

77 CCGAGAGACCAGAACCTGCTTGCTGGAGCTTAGTGCTCAGAGCTGGGGAG 126 

II Ml III III I M III I I II 

901 AACAGTTCCCAAGTTTTCCCTTGTGCATTGTTCTGAGCAGTGTGATGAAG 950 

_ . 

127 GGAGGTTCCGCCGCTCCTCTGCTGTCAGCGCCGGCAQCCCCTCCCGGCTT 176 

I I I II I II I I Ml I I 

951 AGTCTCTGCCGTGCCATCTGGGATGCAAACCGTCCCTGTGTCCCCCACGT 1000 

177 CACTTCCTCCCGCAGCCCCTGCTACTGAGAAGCTCCGGGATCCCAGCAGC 226 

I M Ml II I M M I Mill 

1001 CCAGGCCGTAGATGCTCCCCGCCGGTCAGTCACTTAGTCGTCAGATCGCC 1050 

227 CGCCACGCCCTGGC CTCAGCCTGCGGG 253 

'Ml I I 1 M III I I 

1051 CGTCCTGGTATCACAGTGCTTCTGTTCAGGTTGCACACTGGGCCACAGAG 1100 
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2 54 GCTCC AGTCAGGCC AACACCGACGCGC AGCTGGG AGG AAG 293 

I Mill Ml I I I I I I I M I 

1101 GATCCAGCAAGGATGAAGAAATGGAGCAGCACAGACTTGGGGACAGCTGC 1150 

294 ! . .ACAGGACCCTTGACATCTCCATCTGCACAGAGGTCCTG 331 

I Mill IN I 1 Ml Ml 

1151 GGACCCACTCCAAAAGGACACCTGCCCAGACCCCCTGGATGGAGACCCTA 1200 



332 GCTGGACCGAGCAGCCTCCTCCTCCTAGGATGACCTCACCCTCCAGC. -T 379 

HI I I I I M II Ml M Ml 

ACTCCAGGCCACCTCCAGCCAAGCCCCAGCTCCCCACGGCCAAGAGCCGC 



1201 



1250 



380 CTCCAGTTTTCAGGTTGGAGACATTAGATGGAGGCCAAGAAGATGGCTCT 4 29 

I I I I I M II II > i t M i 

1251 ACCCGGCTCTTTGGGAAGGGTGACTCGGAGGAGGCTTTCCCGGTGGATTG 1300 

4 30 GAGGCGGACAGAGGAAAGCTGGATTTTGGGAGCGGGCTGCCTCCCATGGA 4 79 
| Ml | | I I I I I I I 

1301 CCCCCACGAGGAAGGTGAGTTGGACTCCTGCCCGACCATCACAGTCAGCC 1350 

4 80 GTCACAGTTCCAGGGCGAGGACCGGAAATTCGCCCCTCAGATAAGAGTCA 529 

I | Ml I II I I Ml I ' I M I i I 

1351 CTGTTATCACCATCCAGAGGCCAGGAGACGGCCCCACCGGTGCCAGG. .C 1398 



530 ACCTCAACT ACCGAAAGGGAACAGGTGCCAGTCAGCCGGATCCAAACCGA 

M II I II I" 1,1 III 

1399 TGCTGTCCCAGGACTCTGTCGCCGCCAGCACCGAGAAGACCCTCAGGCTC 

580 TTTGACCGAGATCGGCTCTTCAATGCGGTCTCCCGGGGTGTCCCCGAGGA 
I Ml M I MM I M M lit I I 1 M II 

144 9 T ATG ATCGCAGGAGT ATCT T TGAAGCCGTTGCTCAG AAT AACTGCCAGGA 

€30 TCTGGCTGGACTTCCAGAGTACCTGAGCAAGACCAGCAAGTACCTCACCG 
HIM | n I > MM MM M Ml IMMM I 

14 99 TCTGGAGAGCCTGCTGCTCTTCCTGCAGAAGAGCAAGAAGCACCTCACAG 

680 ACTCGGAATACACAGAGGGCTCCACAGGTAAGACGTGCCTGATGAAGGCT 
,, N | | | Ml Mill IMM II IM MM II 

154 9 ACAACGAGTTCAAAGACCCTGAGACAGGGAAGACCTGTCTGCTGAAAGCC 

. .730 



579 
1448 
629 
1498 
679 
1548 
729 
1598 
779 



GTGCTGAACCTTAAGGACGGAGTCAATGCCTGCATTCTGCCACTGCTGCA 

MM Mill I HUM M II IM I I M III I 
1599 ATGCTCAACCTGCACGACGGACAGAACACCACCATCCCCCTGCTCCTGGA 1648 



GATCGACAGGGACTCTGGCAATCCTCAGCCCCTGGTAAATGCCCAGTGCA 
Mill Ml III! I II M II II IM III 



829 



760 

164 9 ^TC^GCGGCAAACGGACAGCCTGAAGGAGCTTGTCAACGCCAGCTACA 1098 



830 



879 



CAGATGACTATTACCGAGGCCACAGCGCTCTGCACATCGCCATTGAGAAG 

, 1 1 Ml Ml IMM I II IMMMMIMM MM 
1699 CGGACAGCTACTACAAGGGCCAGACAGCACTGCACATCGCCATCGAGAGA .1748 

AGGAGTCTGCAGTGTGTGAAGCTCCTGGTGGAGAATGGGGCCAATGTGCA 929 

, ■ | | | | | I M I I I I I M M M I II II IMM 

CGCAACATGGCCCTGGTGACCCTCCTGGTGGAGAACGGAGCAGACGTCCA 1798 



680 
1749 

930 



TGCCCGGGCCTGCGGCCGCTTCTTCCAGAAGGGCCAAG. . * GGACTTGCT 976 

H MM I I MUM MM I I I I II M I I 

1799 GGCTGCGGCCCATGGGGACTTCTTTAAGAAAACCAAAGGGCGGCCTGGAT 1848 
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• • • * • 

977 TTTATTTCGGTGAGCTACCCCTCTCTTTGGCCX3CTTGCACCAAGCAGTGG 1026 
I || | I I I I I I I II I I I I I II I I I I I I I II I I I II I Ml I 
184 9 TCTACTTCGGTGAACTGCCCCTGTCCCTGGCCGCGTGCACCAACCAGCTG 1898 

1027 GATGTGGTAAGCTACCTCCTGGAGAACCCACACCAGCCCGCCAGCCTGCA 1076 

I I I I I I I I I I I I I I II I I I I I I I I I II 

1899 GGCATCGTGAAGTTCCTGCTGCAGAACTCCTGGCAGACGGCCGACATCAG 194 8 

1077 GGCCACTGACTCCCAGGGCAACACAGTCCTGCATGCCCTAGTGATGATCT 1126 
I I I I I I I I I .MINIMI II I II M I I II I III I I 

194 9 CGCCAGGGACTCGGTGGGCAACACGGTGCTGCACGCCCTGGTGGAGGTGG 1998 

• 

1127 CGGACAACTCAGCTGAGAACATTGCACTGGTGACCAGCATGTATGATGGG 1176 
I II II I I I M M II I I I II I M II M II II I II I 

1999 CCGACAACACGGCCGACAACACGAAGTTTGTGACGAGCATGTACAATGAG 204 8 

1177 CTCCTCCAAGCTGGGGCCCGCCTCTGCCCTACCGTGCAGCTTGAGGACAT 1226 

I | I | | I II I M I II M II II I I II I II I 

204 9 ATTCTGATGCTGGGGGCCAAACTGCACCCGACGCTGAAGCTGGAGGAGCT 2098 

. • • * 

1227 CCGCAACCTGCAGGATCTCACGCCTCTGAAGCTGGCCGCCAAGGAGGGCA 1276 

I || II I II I I I I II I M I II II I II I Ml 
2099 CACCAACAAGAAGGGAATGACGCCGCTGGCTCTGGCAGCTGGGACCGGGA 214 8 

• • • * 

1277 AGATCGAGATTTTCAGGCACATCCTGCAGCGGGAGTT TTCAGGA 1320 

MUM I I M I II M IMMMM I II 

214 9 AGATCGGGGTCTTGGCCTATATTCTCCAGCGGGAGATCCAGGAGCCCGAG 2198 

1321 CTGAGCCACCTTTCCCGAAAGTTCACCGAGTGGTGCTATGGGCCTGTCCG 1370 

II I II I I I I I I I M I M M M M M I I I I I II I I I I I 
2199 TGCAGGCACCTGTCCAGGAAGTTCACCGAGTGGGCCTACGGGCCCGTGCA 2248 

. . • • • 

1371 GGTGTCGCTGT ATGACCTGGCTTCTGTGGACAGCTGTGAGGAGAACTCAG 14 20 

II II II Ill I I I I I I I Ml III II II I M I 

224 9 CTCCTCGCTGTACGACCTGTCCTGCATCGACACCTGCGAGAAG/^ACTCGG 2298 

1421 TGCTGGAGATCATTGCCTTTCATTGCA AGAGCCCGCACCGACACCGA 14 67 

llllllll I I I MM Ml Ml Ml I II Ml 

2299 TGCTGGAGGTGATCGCCTACAGCAGCAGCGAGACCCCTAATCGCCACGAC 2348 

1468 ATGGTCGTTTTGGAGCCCCTGAACAAACTGCTGCAGGCGAAATGGGA 1514 

Ml II I IIIIIM I I I I I I Ml I I I I I I I M Mill 

234 9 ATGCTCTTGGTGGAGCCGCTGAACCGACTCCTGCAGGACAAGTGGGACAG 2398 

. • • • 

1515 TCTGCTCATCCCCAAGTTCTTCTTAAACTTCCTGTGTAATCTGATCTACA 1564 

I Ml III II M I I I I MM I I I I I I MM 

2399 ATTCGTCAAGCGCATCTTCTACTTCAACTTCCTGGTCTACTGCCTGTACA 2448 

. . • • • 

1565- TGTTCATCTTCACCGCTGTTGCCTACCATCAGCCTACCCTGAAGAAGCAG 1614 

II I 1 I I I I I I I i I I IIIIIM 1 I Ml M 

2449 TGATCATCTTCACCATGGCTGCCTACTA. . . . CAGGCCCGTGGATGGCTT 2494 

1615 GCCGCCCCTCACCTGAAAGCGGAGGTTGGAAACTCCATGCTGCTGACGGG 1664 

Ml Ml I I M I I Mill III II I II II 
24 95 GCCTCCCTTTA . . AGATGGAAAAAATTGGAGACTATTTCCGAGTTACTGG 2542 



FIGURE 6 (cont'd) 



BNSDOCiD: < WO 0029577A 1 J_> 



WO 00/29577 



PCT/US99/26701 



17/35 

1665 CCACATCCTTATCCTGCTAGGGGGGATCTACCTCCTCGTGGGCCAGCTGT 1714 

I I II I I It 1 I f I if I I M I I I I I I II 

254 3 AGAGATCCTGtCTGTGTTAGGAGGAGTCTACTTCTTTTTCCGAGGGATTC 2592 

• • • • • 
1715 GGTACTTCTGGCGGCGCCACGTGTTCATCTGGATCTCGTTCAT AGACAGC 1764 

I I I I I I I I 1 I I II II III III I MUM 

2593 AGTATTTCCTGCAGAGGCGGCCGTCGATGAAGACCCTGTTTGTGGACAGC 264 2 

• • * * * 
1765 TACTTTGAAATCCTCTTCCTGTTCCAGGCCCTGCTCACAGTGGTGTCCCA 1814 

I J I III II I I., f. I I I I f I I I I I I I II IN i I 
264 3 TACAGTGAGATGCTTTTCTTTCTGCAGTCACTGTTCATGCTGGCCACCGT 2 692 

1815 GGTGCTGTGTTTCCTGGCCATCGAGTGGTACCTGCCCCTGCTTGTGTCTG 1864 
I I I I I I II III f II II 111 I I I I I I I 

2693 GGTGCTGTACTTCAGCCACCTCAAGGAGTATGTGGCTTCCATGGTATTCT 274 2 

18 65 CGCTGGTGCTGGGCTGGCTGAACCTGCTTTACTATACACGTGGCTTCCAG 1914 

I I I I I I I I I I I I I I 11 I 1 I I II 11 I II M M MINI 
274 3 CCCTGGCCTTGGGCTGGACCAACATGCTCTACTACACCCGCGGTTTCCAG 27 92 

• • • • • 
1915 CACACAGGCATCTACAGTGTCATGATCCAGAAGGTCATCCTGCGGGACCT 1964 

III I 11 I I I I I 1 I I I I I I I I I I I I I I 1 I I I I I 1 I I I I 

2793 CAGATGGGCATCTATGCCGTCATGATAGAGAAGATGATCCTGAGAGACCT 2842 

• . • • ♦ 
1965 GCTGCGCTTCCTTCTGATCTACTTAGTCTTCCTTTTCGGCTTCGCTGTAG 2014 

I II III I I I II M I I I I I I I I I I 1 I I I I I II 

284 3 GTGCCGTTTCATGTTTGTCTACATCGTCTTCTTGTTCGGGTTTTCCACAG 2892 

• • • • • 
2015 CCCTGGTGAGCCTGAGCCAGGAGGCTTGGCGCCCCGAAGCTCCT ACAGGC 2064 

I I I I I 1 I I I I I I II I I I I I I I 

2893 CGGTGGTGACGCTGATTGAAGACGGGAAGAATGACTCCCTGCCGTCTGAG 2942 

. • * • - 

2065 CCCAATGCCACAGAGTCAGTGCAGCCCATGGAGGGACAGGAGG ACGAGGG 2114 

I II I III 1 I I I I I II 

294 3 TCCA CGTCGCACAGGTGGCGGGGGCCTGCCTGCAGGCC 2980 

• • • • • 
2115 CAACGGGGCCCAGTACAGGGGTATCCTGGAAGCCTCCTTGGAGCTCTTCA 2164 

I II I Nil I I I I I I II I INI I IN 

2981 CCCCGATAGCTCCTACAACAGCCTGTACTCCACCTGCCTGGAGCTGTTCA 3030 

• • • • • 
2165 AATTCACCATCGGCATGGGCGAGCTGGCCTTCCAGGAGCAGCTGCACTTC 2214 

I I M I II II I N II I II I I! I II I I III III I INN 
3031 AGTTCACCATCGGCATGGGCGACCTGGAGTTCACTGAGAACTATGACTTC 3080 

• • * * • 
2215 CGCGGCATGGTGCTGCTGCTGCTGCTGGCCTACGTGCTGCTCACCTACAT 2264 

I I I I I I I II I I II I I I I I I N I I I I I I I I I I I I 
3081 AAGGCTGTCTTCATCATCCTGCTGCTGGCCTATGTAATTCTCACCTACAT 3130 

• • • • • 
2265- CCTGCTGCTCAACATGCTCATCGCCCTCATGAGCGAGACCGTCAACAGTG 2314 

III I I 11 I N II I II N N II N II II N I I I I I I I Mlllll 
3131 CCTCCTGCTCAACATGCTCATCGCCCTCATGGGTGAGACTGTCAACAAGA 3180 

• • • * • 
2315 TCGCCACTGACAGCTGGAGCATCTGGAAGCTGCAGAAAGCCATCTCTGTC 2364 

I I I I II III II I I I I I I I I I I I I I I I I I I I I I I I I I II 

3181 TCGCACAGGAGAGCAAGAACATCTGGAAGCTGCAGAGAGCCATCACCATC 3230 
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2365 CTGGAGATGGAGAATGGCTATTGGTGGTGCAGGAAGAAG . . . CAGCGGGC 2411 

Mill I MM I I J I I M I II I II I 

3231 CTGGACACGGAGAAGAGCTTCCTTAAGTGCATGAGGAAGGCCTTCCGCTC 3280 

2412 AGGTGTGATGCTGACCGTTGGCACTAAGCCAGATGGCAGCCCGGATGAGC 24 61 

Ml 1 I I I I I I I I I I f I I I I I I I I II II 

3281 AGGCAAGCTGCTGCAGGTGGGGTACACACCTGATGGCAAGGACGACTACC 3330 

24 62 GCTGGTGCTTCAGGGTGGAGGAGGTGAACTGGGCTTCATGGGAGCAGACG 2511 

I I I I I M I I I I I I J I I I I I I I I I I I I I I I I I I Ml I t 

3331 GGTGGTGCTTCAGGGtGGACGAGGTGAACTGGACCACCTGGAACACCAAC 3380 

2512 CTGCCTACGCTGTGTGAGGACCCG . . . TCAGGGGCAGGTGTCCCTCGAAC 2558 

M I I I I I I I I M II N Ml I I 11 

3381 GTGGGCATCATCAACGAAGACCCGGGCAACTGTGAGGGCGTCAAGCGCAC 34 30 

2 559 TCTCGAGAACCCTGTCCTG GCTTCCCCTCCCAAGGAGGATGAGGAT 2604 

M I I I I I M M N I I 

34 31 CCTGAGCTTCTCCCTGCGGTCAAGCAGAGTTTCAGGCAGACACTGGAAGA 34 80 

2605 GGTGCCTCTGAGGAAAACTATGTGCCCGTCCAGCTCCTCCAGTCCAACTG 2654 

| | I III II I Ml II II 

34 81 ACTTTGCCCTGGTCCCCCTTTTAAGAGAGGCAAGTGCTCGAGATAGGCAG 3530 

• • * * 

2 655 ATGGCCCAGATGCAGCAGGAGGCCAGAGGACAGAGCAGAGGATCTTTCCA 2704 

U Ml II I I . Mill III III I I 

3531 TCTGCTCAGCCCGAGGAAGl'TTATCTGCGACAGTTTTCAGGGTCTCTGAA 3580 

2705 ACCACATCTGCTGGCTCTGGGGTCCCAGTGAATTCTGGTGGCAAATATAT 2754 

| M I I I M I II I I M I III I 

3581 GCCA GAGGACGCTGAGGTCTTCAAGAGTCCTGCCGCTTCCGGGGA 3625 

2755 ATTTTCACTAACTCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 2804 

II I Ml I I I I Ml III 

3626 GAAGTGAGGACGTCACGCAGACAGCACTGTCAACACTGGGCCTTAGGAGA 3675 



2805 AAAAA 

3676 CCCCGTTGCCACGGGGGGCTGCTGAGGGAACACCAGTGCTCTGTCAGCAG 



2809 
3725 
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CLUSTAl* W (1.74) multiple sequence alignment 



human VR2 
ratVR2 



MTSPSSSPVFRLETLJX5GOEDGSEADRGKLDFGSGLPPMESOFOGEDRKFAP0IRVKLNY 



human VR2 
ratVR2 



RKGTGASQPDPNRFDRDRLFNAVSRGVPEDIAGLPEY^ 



humanVR2 
ratVR2 



MKAVLNLKDG VNAC I I*PLIiQIDRDSGNPQPLVHAQCTDD YYRGHSAJLH I AIEKRSLQCVK 



human VR2 
ratVR2 



LL VENGANVHARAOGR FFQ KGQGTC F Y FG ELP L»S LAACTKQWD WS YLLENPHQ P AS UQA 
S THAS ALSLAACTKQ WD VVT YI*LENPHQP ASI*EA 



humanVR2 
ratVR2 



TDSQGNTVLHALiVM I SDNS AEN I ALVTSHYIXSLLQAGARLiCPTVQLiED I RNLiQDLTPLKL 
TDS LGNTVLHALVM I ADNS PENS ALV I HhmXSLLQMGARLCPTVQLEE I SNHQGL/T PLKL 



human VR2 
ratVR2 



AAKEGKIEIFRHIlXJI^FSG-LSHLSRKFTEWCyGPVRVSLYDIJ^VDSCEENSVI^IIA 
AAKEGKI E I FRHI UJREFSGP YQPI-S RKFTEWCYG PVRVSLYDI^SS VDS WEKNS VLEI IA 



human VR2 
ratVR2 



human VR2 
ratVR2 



FHCKSPHRJIRhTVAfl^PLNKLiIX^AKWDI^ 

FHCKS PNRHRMWLE PLNKLLQEKTORLVSRPFFNPACYXiVYMFI FTWAYHQPSLDQPA 
A********************* *** *.,.**.** * ******** .**«***•* . * 

APKLKAEVGNSMI^TGHILII^IXXSI 
IPSSKATFGESMLLIiGHILIIjySGIYIJ^IiTO 



human VR2 
ratVR2 



I*TWSQVLCFIAIEWYIiPI*LVSAI^^ 

******** * . ******** .**************************+*********« 



human VR2 
ratVR2 



humanVR2 
ratVR2 



LIYLiVFLFG FAVALVSI^QETVWRPEAPTGPNATES VOPMK<^mT^f^(^iVQVT?f; tt .r? a cy.t? 
LVYLVFLFG FAVAliVS I*S REARS PKAPEONNSTVTEQPTVGQEEEP APYRSILDASIjE 

A;****************;** *j#* . *;* - *• ***** * *«.**.**«* 

lirisjrA'KMGKIAFQBQLHFRGMVTJ JiTiT ^^Vl^TYTl^lJMfi^lKUt^ETVKSVJ^rDS^X 
IiFkjrrlGMaKIAgQHQI^FRgWr J J iTtTAYVIiI^YVIJJJnfr.TflXiM^ 



humanVR2 
ratVR2 



KKIiQKAICVI^MENGYHWCR- KKQRAOVMMVOTKPDGS PDERKCFKVKEVNKAS WBQTIj 
WlQ^KAIgVIjEMENaYMWCRRiaCH*^^ 



humanVR2 
ratVR2 



PTLCSDPSGAGVPRTIjQTPVZjAS PPKEDEDGASESNYVPVQIAjQSN 
PXl«SRDPSGPGXTCNKKNPT8K-POR — -KSASEEDHLPLQVLQSP 
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GAP of: ratvr2.pep check: 9190 from: 1 to: S54 
ratVR2 Flrxbl47gll 

to: humanvr2.pep check: S746 from: 1 to: 764 
humanVR2 Plh21ell 

Symbol comparison table: /usr/local/gcg_9 . i/gcgcore/data/rundata/blosum62 . cmp 
CompCheck: 6430 

Gap Height: 12 Average Match: 2.912 

Length Height: 4 Average Mismatch: -2.003 

Quality: 2182 Length: 766 

Ratio: 3.939 Ga P 8 : J 

Percent Similarity: 81.703 Percent Identity: 79.167 

Match display thresholds for the alignment (s) : 
| o IDENTITY 

: = 2 
. = 1 

ratvr2.pep x humanvr2 .pep 



.STHASAI^I^CTlbwDvVTY^ 



44 



iiiiiiiiiiiii-iiiiiiiiiiihini nun 

45 ALVMIAmSPENSALV^ 94 

1 1 1 1 1 III II III II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 : 1 I 1 111111 „ 



301 



.45 B*^^^™??*™^ 



194 
399 



19S 
400 



294 
499 



245 wraPm^^ 

4S0 ililii!^ 

,7, 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 * 1 1 1 II 1 1 1 

,45 »^««^^ 

S50 iUlUU^ - 
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393 «™««"™^^ 4 " 

600 LFKFTIGMGELAFQEQUHFRGMVLLLLlJVYV^i 

«43 ^nswsikki^isv^ 492 

650 IJl^lMUIlin^^ oe 
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GAP of: humanvrl.seq check: 4554 from: 1 to: 3909 
humanVRl Fbhl854 7pat - Import - complete 

to: ratvrl.seq check: 7921 from: 1 to: 2847 

ratVRl.seq AF029310 in GenBank 

Symbol comparison table: 
/ddnriocal/gcg/gcg_9.1/gcgcore/data/rundata/nw S gapdna.c m p 

CompCheck: 8760 

Gap Weight: 50 Average Match: 10.000 

Length Weight: 3 Average Mismatch: 0.000 

Quality: 22717 Length: 3914 

Ratio: 7.979 Gaps: 10 

Percent Similarity: 82.125 Percent Identity: 82.125 

Match display thresholds for the alignment (s) : 

| = IDENTITY 

: - 5 

. = 1 

humanvrl.seq x rafcvxl . seq . 



1001 CCAGGCCGTAGATGCTCCCCGCCGGTCAGTCACTTAGTCGTCAGATCGCC 1050 

1 _ CAGCTCCAAGGCACTTGCTCC 21 

1051 CGTCCTGGTA.TCACAGTGCTTCTGTTCAGGTTGCACACTGGGCCACAGAG 1100 
■ I I I I II II | | I I I I I I I 1 I I I II I I I I M 

22 ATTTGGGGTGTGCCTGCACCT . - - AGCTGGTTGCAAATTGGGCCACAGAG 68 

GAAGAAATGGAGCAGCACAGACTTGGGGACAGCTGC 1150 

I I I i i I 1 I | I | III I I I I I II III 

1 1 1 1 ' 'ill"' '.^.^^^/-Tivr^TTAKACTCAGAGGAGTCTGA 118 



1101 GATCCAGCAAGGATGAAGAAATGGAfcU/*^^^ * x — , , 7 

l I I I I I I I I I I I I II II Ml I I I I I II 111 

69 GATCTGGAAAGGATGGAACAACGGGCTAGCTTAGACTCAGAGGAGTCTGA 

1151 GGACCCACTCCAAAAGGA^ 1200 
I Mill MM M M_MM M„J> J "^^ 
119 GTCCCCACCO 



:CAAGAGAACTCCTGCCTGGACCCTCCAGACAGAGACCCTA 168 



1201 ACTCCAGGCCACCTCCAGCCAA^^ 1250 
mi ii i I I I I I ! I | I I Mill II HI 111 111 11 

169 ACTGCAAGCCA^ 



218 



TGGGAAGGGTGACTCGGAGGAGGCTTTCCCGGTGGATTG 1300 

268 



1251 ACCCGGCTCTTTGGGAAGGGTUA^n-trt^^v^^- * 7 7 

I I I I 1111 II II I I M II I I I M I I M M M I I I I M I INI 
219, ACOC^GCOTCTTGGGAAGGGTGACTCGGAGGIVGGOCTCTCOCCTGGACTG 

1301 CCCCCACGAGGAAGGTGAGTTGGACTCCTGCCCGAOCATC 

in i M m ii ii i _i _LL1 LJULiJLJUULi JiHHJHi^, 

269 CCCTTATGAGGAA 
1351 CTGTTATCACCAT 

II II I M I I I 

CCAGAGGCCTG 
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CCCTTATGAGGAAGGCGG^CTGGCTTCCTGiCCCTATCATCACTGTCAGCT 318 

™ CCAGAGGCCAGGAGACGGCCCCACCGGTGCCAGGCTG 1400 

i i i i I I II I I II II I II M II M M II M Ml Mill I 
319 CTGTTCTAACTATCCAGAGGCCTGGGGATGGACCTGCCAGTGTCAGGCCG 368 
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1450 



1401 CTGTCCCAGGACTCTGTCGCCGCCAGCACCGAGAAGACCCTCAGGCTCTA 
I I I I I I I I I I I It I I I I I I I I I I I I IK I I I I I I I I 

369 TCATCCCAGGACTCCGTCTCCGCTGG . • • TGAGAAGCCCCCGAGGCTCTA 415 



14 51 TGATCGCAGGAGTATCTTTGAAGCCGTTGCTCAGAATAACTGCCAGGATC 

| | | | I I I I I I I I I II M IIMIII llllllllllll I 

416 TGATCGCAGGAGCATCTTCGATGCTGTGGCTCAGAGTAACTGCCAGGAGC 

1501 TGGAGAGCCTGCTGCTCTTCCTGCAGAAGAGCAAGAAGCACCTCACAGAC 

| | | I | | | | M | I M I I I I I I I I M I I I M M I I I I I I MI II HI 

4 66 TGGAGAGCCTGCTGCCCTTCCTGCAGAGGAGCAAGAAGCGCCTGACTGAC 



1551 
516 
1601 



AACGAGTTCAAAGACCCTGAGACAGGGAAGACCTGTCTGCTGAAAGCCAT 

I llllllllllll IN III Mill MM MMI I MM II MUM 

AGCGAGTTCAAAGACCCAGAGACAGGAAAGACCTGTCTGCTAAAAGCCAT 



GCTCAACCTGCACGACGGACAGAACACCACCATCCCCCTGCTCCTGGAGA 

| | | I It I I M M I II MM! I II M I I I II II I I I I I I I 

566 GCTCAATCTGCACAATGGGCAGAATGACACCATCGCTCTGCTCCTGGACG 



1651 TCGCGCGGCAAACGGACAGCCTGAAGGAGCTTGTCAACGCCAGCTACACG 

| || | | | | || I II II M M M I M M I M II M M II I M II 

616 TTGCCCGGAAGACAGACAGCCTGAAGCAGTTTGTCAATGCCAGCTACACA 

1701 GACAGCTACTACAAGGGCCAGACAGCACTGCACATCGCCATCGAGAGACG 

M M II II M II II I II M II I M II I I M I M II II I M II I I I 

666 GACAGCTACTACAAGGGCCAGACAGCACTGCACATTGCCATTGAACGGCG 



1500 
465 
1550 
515 
1600 
565 
1650 
615 
1700 
665 
1750 
715 
1800 



1751 CAACATGGCCCTGGTGACCCTCCTGGTGGAGAACGGAGCAGACGTCCAGG 
I I I I I I | | M | ! I I I I I I I I M I II I I M I I II II II IIMIII 
716 GAACATGACGCTGGTGACCCTCTTGGTGGAGAATGGAGCAGATGTCCAGG 765 

■ 

1801 CTGCGGCCCATGGGGACTTCTTTAAGAAAACCAAAGGGCGGCCTGGATTC 

mini i 1 1 1 1 1 1 1 1 i 1 1 in him mini m 

766 CTGCGGCTAACGGGGACTTCTTCAAGAAAACCAAAGGGAGGCCTGGCTTC 
1851 TACTTCGGTGAACTGCCCckTCCCTGGCCGCGTGC^CCAACCAGCTGGG 

inn inn 1 1 iii ii m mini immmiimiin 

816 TACTTTGGTGAGCTGCCCCTGTCCCTGGCTGCGTGCACCAACCAGCTGGC 

1901 CATCGTGAAGTTCCTGCTGCAGAACTCCTGGCAGACGGCCGACATCAGCG 1950 

Mill IIIIM NIMH I II llllllllllll I H I II IIMIII 

CATTGTGAAGTTCCTGCTGCAGAACTCCTGGCAGCCTGCAGACATCAGCG 



1850 
815 
1900 
865 



866 

1951 CCAGGGACTCGGTG^^ 

I | m I I I 1 1 I I III III III I I I III II I III I I III II I III I I 

91 6 CCCGGGACTCAGTGGGCAACACGGTGCTTCATGCCCTGGTGGAGGTGGCA 

2001 GACAACACGGCCGACAACACGAAGTTTGTGACGAGCATCT 

- ii Mm i 1 1 1 m 1 1 1 1 m mil i ii in 1 1 iii i iim 

966 GATAACACAGTTGACAACACCAAGTTCGTGACAAGCATGTACAACGAGAT 

• • • 

2051 TCTGATGCTGGGGGCCAAACTGCACCCGACGCTGAAGCTGGAGGAGCTCA 

2051 mniMimiiiiii imm immmimimi mi mi 

1016 CTTGATCCTGGGGGCCAAACTCCACCCCACGCTGAAGCTGGAAGAGATCA 



915 
2000 
965 
2050 
1015 
2100 
1065 
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2101 CCAACAAGAAGGGAATGACGCCGCTGGCTCTGGCAGCTGGGACCGGGAAG 2150 
| I I I I I MINI I I I I II I I II I I I I I I I HI I I I I I I I I 

1066 CCAACAGGAAGGGGCTCACGCCACTGGCTCTGGCTGCTAGCAGTGGGAAG 1115 

2151 ATCGGGGTCTTGGCCTATATTCTCCAGCGGGAGATCCAGGAGCCCGAGTG 2200 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 
1116 ATCGGGGTCTTGGCCTACATTCTCCAGAGGGAGATCCATGAACCCGAGTG 1165 



2201 



2250 



CAGGCACCTGTCCAGGAAGTTCACCGAGTGGGCCTACGGGCCCGTGCACT 

| | | | II I I I I i I i I I I I I I I 1 I I I I I I I I I I I I I i Ml I 

1166 CCGACACCTATCCAGGAAGTTCACCGAATGGGCCTATGGGCCAGTGCACT 1215 



2300 



2251 CCTCGCTGTACGACCTGTCCTGCATCGACACCTGCGAGAAGAACTCGGTG 

| | | | || M I I I I I I I I I I I I I I I I I I I I M M I I I I I I I I I I I 
1216 CCTCCCTTTATGACCTGTCCTGCATTGACACCTGTGAAAAGAACTCGGTT 1265 



2301 CTGG AGGTGATCGCCT ACAGCAGCAGCGAGACCCCTAATCGCCACG ACAT 

| | | | | I I I I I I I I I I I I I I I I I III I I I I I I I I I I I II II I I I I I 
1266 CTGGAGGTGATCGCTTACAGCAGCAGTGAGACCCCTAACCGTCATGACAT 

2351 GCTCTTGGTGGAGCCGCTGAACCGACTCCTGCAGGACAAGTGGGACAGAT 

in i mil ii 1 1 1 1 1 1 ii ii 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ii 1 1 1 1 

1316 GCTTCTCGTGGAACCCTTGAACCGACTCCTACAGGACAAGTGGGACAGAT 

24 01 TCGTCAAGCGCATCTTCTACTTCAACTTCCTGGTCTACTGCCTGTACATG 

| | | I | | J I I I I I I 1 I I 1 1 • I I I • I I t I I I I II II Nil Mil III 
1366 TTGTCAAGCGCATCTTCTACTTCAACTTCTTCGTCTACTGCTTGTATATG 

. • * 

2451 ATCATCTTCACCATGGCTGCCTACTACAGGCCCGTGGATGGCTTGCCTCC 

I 1 I f I t I 1 I I I I I I I I I I 1 I I 1 1 1 MM Mill Mill 

1416 ATCATCTTCACCGCGGCTGCCTACTATCGGCCTGTGGAAGGCTTGCCCCC 

2501 CTTTAAGATGGAAAAAA. . 1 TTGGAGACTATTTCCGAGTTACTGGAGAGA 2547 

II 1 1 | | || MM I MM IM1MMMIMI I M 

CTATAAGCTGAAAAACACCGTTGGGGACTATTTCCGAGTCACCGGAGAGA 



2350 
1315 
2400 
1365 
2450 
1415 
2500 
1465 



1466 

* 

2548 TCCTGTCTGTGTTAGGAGGAGTCTACTTCTTTTTCCGAGGGATTCAGTAT 

H Ml III III III III MM M MM MUM II II Ml 

1516 TCTTGTCTGTGTCAGGAGGAGTCTACTTCTTCTTCCGAGGGATTCAATAT 

2598 TTCCTGCAGAGGCGGCCGTCGATGAAGACCCTGTTTGTGGACAGCTACAG 

IMIIII M II I MM IIMIMIIMMI HIM 

1566 TTCCTGCAGAGGCGACCATCCCTCAAGAGTTTGTTTGTGGACAGCTACAG 

2648 TGAGATGCTTTTCTTTCTGCAGTCACTGTTCATGCTGGCCACCGTGGTGC 
IMIII | I | | | 1 1 | | I IMM MM II Ml II II I Mill I 
1616 TGAGATACTTTTCTTTGTACAGTCGCTGTTCATGCTGGTGTCTGTGGTAC 



1515 
2597 
1565 
2647 
1615 
2697 
1665 



2698 TGTACTTCAGCCACCTCAAGGAGTATGTGGCTTCCATGGTATTCTCCCTG 2747 

I 1 1 1 1 1 | I I 1 1 1 1 I I I I I I I I I I I I 1 1 1 I I Ml MMMIM 

1666 TGTACTTCAGCCAACGCAAGGAGTATGTGGCTTCCATGGTGTTCTCCCTG 1715 

2748 GCCTTGGGCTGGACCAACATGCTCTACTACACCCGCGGTTTCCAGCAGAT 2797 

I HI Mill II MM II Mill 

1716 GCCATGGGCTGGACCAACATGCTCTACTATACCCGAGGATTCCAGCAGAT 1765 
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27 98 GGGCATCTATGCCGTCATGATAGAGAAGATGATCCTGAGAGACCTGTGCC 284 7 

I | I II II I I 1 I I I I I I I I I I I I I ! I I I M M I I I I I I I I I I I I I I I I 

17 66 GGGCATCTATGCTGTCATGATTGAGAAGATGATCCTCAGAGACCTGTGCC 1815 

284 8 GTTTCATGTTTGTCTACATCGTCTTCTTGTTCGGGTTTTCCACAGCGGTG 2897 

I || I I | I I I I I I I I MM M II II M II I M M M I II I I M 
1816 GGTTTATGTTCGTCTACCTCGTGTTCTTGTTTGGATTTTCCACAGCTGTG 1865 

28 98 GTGACGCTGATTGAAGACGGGAAGAATGACTCCCTGCCGTCTGAGTCCAC 2947 

Mill IMIIMI II Mill MM MM Mill IMIIMI 

18 66 GTGACACTGATTGAGGATGGGAAGAATAACTCTCTGCCTATGGAGTCCAC 1915 

* * * * 

2 94 8 GTCGCACAGGTGGCGGGGGCCTGCCTGCAGGCCCCCCGATAGCTCCTACA 2997 

I | I I I III I I I I I I I I I I I I I I I I M I II I I I I I I I 

1916 ACCACACAAGTGCCGGGGGTCTGCCTGCAAG - - - CCAGGTAACTCTTACA 1962 

2 998 ACAGCCTGTACTCCACCTGCCTGGAGCTGTTCAAGTTCACCATCGGCATG 304 7 

I | I I | | M II Mill M II II I I II M II M M II II M M M M It 
1963 ACAGCCTGTATTCCACATGTCTGGAGCTGTTCAAGTTCACCATCGGCATG 2012 

304 8 GGCGACCTGGAGTTCACTGAGAACTATGACTTCAAGGCTGTCTTCATCAT 3097 

| M II I II I I I M II II I II If It II I I I I M I I M ! I M 1! I I I I I I I 
2013 GGCGACCTGGAGTTCACTGAGAACTACGACTTCAAGGCTGTCTTCATCAT 2062 

3098 CCTGCTGCTGGCCTATGTAATTCTCACCTACATCCTCCTGCTCAACATGC 3147 

lilt | || M M I I M I M I I I I II I M 1 I I I II II II I I I II II I I 
2063 CCTGTTACTGGCCTATGTGATTCTCACCTACATCCTTCTGCTCAACATGC 2112 

3148 TCATCGCCCTCATGGGTGAGACTGTCAACAAGATCGCACAGGAGAGCAAG 3197 

Mil M MIIMMMIMI M II M M II I Mill IMIIMM 
2113 TCATTGCTCTCATGGGTGAGACCGTCAAC71AGATTGCACAAGAGAGCAAG 



2162 



3198 AACATCTGGAAGCTGCAGAGAGCCATCACCATCCTGGACACGGAGAAGAG 3247 
| || II M M I I M M I M I I I I I M I I M II I II M I I II IMIIMI 
AACATCTGGAAGCTGCAGAGAGCCATCACCATCCTGGATACAGAGAAGAG 



2163 



2212 



324 8 CTTCCTTAAGTGCATGAGGAAGGCCTTCCGCTCAGGCAAGCTGCtGCAGG 3297 

IMIM MM MM MMM IMIIMM Ml MIIMMMIMI M 
2213 CTTCCTGAAGTGCATGAGGAAGGCCTTCCGCTCTGGCAAGCTGCTGCAGG 2262 

3298 TGGGGTACACACCTGATGGCAAGGACGACTACCGGTGGTGCTTCAGGGTG 3347 

MMM Ml IMM MMMM I I I I I M 1 I I I II I IMIIMM 
2263 TGGGGTTCACTCCTGACGGCAAGGATGACTACCGGTGGTGTTTCAGGGTG 



2312 
3397 
2362 



3348 GACGAGGTGAACTGGACCACCTGGAACACCAACGTGGGCATCATCAACGA 
IMIIMI MMMM I MMM Ml MM Mill I II Ml II III 
2313 GACGAGGTAAACTGGACTACCTGGAACACCAATGTGGGTATCATCAACGA 

3398 AGACCCGGGCAACTGTGAGGGCGTCAAGCGCACCCTGAGCTTCTCCCTGC 3447 

MM! M M II M II I M M II M M I I I M I I I M M II M M M I 
2363 GGACCCAGGCAACTGTGAGGGCGTCAAGCGCACCCTGAGCTTCTCCCTGA 2412 

34 4 8 GGTCAAGCAGAGTTTCAGGCAGACACTGGAAGAACTTTGCCCTGGTCCCC 34 97 
(Ml IMMM Ml I M I II M I I I M I I I I I I I If III 

2413 GGTCAGGCCGAGTTTCAGGGAGAAACTGGAAGAACTTTGCCCTGGTTCCC 24 62 



34 98 CTTTTAAGAGAGGCAAGTGCTCGAGATAGGCAGTCTGCTCAGCCCGAGGA 
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I M I I I I I I I I I I I I I I I I I I I I I I 1 ' I''' HI' 
CTTCTGAGGGATGCAAGCACTCGAGATAGACATGCCACCCAGCAGGAAGA 



2512 
3597 
2562 



354 8 AGTTTATCTGCGACAGTTTTCAGGGTCTCTGAAGCCAGAGGACGCTGAGG 

1,11 | hi || I I I II II II I I I I I I I I I I I III'''' 
2513 AGTTCAACTGAAGCATTATACGGGATCCCTTAAGCCAGAGGATGCTGAGG 

3598 TCTTCAAGAGTCCTGCCGCTTCCGGGGAGAAGTGA . GGACGTCACGCAGA 3646 

,l||||| || | | I I I I I I I I I I I I I I • I ' 1 • 

TTTTCAAGGATTCCATGGTCCCAGGGGAGAAATAATGGACACTATGCAGG 



2563 



2612 
3696 



364 7 CAGCACTGTCAACACTGGGCCTTAGGAGACCCCGTTGCCACGGGGGGCTG 

ill 1 1 : i 1 1 1 

2613 GATCAATG CGGGGTCTTTGGGTGGTCTG 2640 

3697 CTGAGGGAACACCAGTGCTCTGTCAGCAGCCTGGCCTGGTCTGTGCCTGC 3746 

i I I I I I I I I I I III II 111 I I I I I I I I I I I 

2641 CTTAGGGAAC.CAGCAGGGTTGACGTTATCTGGGTCCACTCTGTGCCTGC 2689 

3747 CCA.GCATGTTCCCAAATCTGTGCTGGACAAGCTGTGGgAaGCGTTCTTG 3795 

■ I ill I I I I I II I Ml I I I I I I I I I I I I 
2690 CTAGGCACATTCCTAGGACTTCGGCGGGCCTGCTGTGGGAA . CTGGGAGG 2738 

3796 GAAGCATGGGGAGTGATGTACATCCAACCGTCACTGTCCCCAAGTGAAT 

I I I I I I I I I I I I I I I I I I I I I I I I I 

2739 TGTGTGGGAATTGAGATGTGTATCCAACCATGA. . . TCTCCAAACATTT — 

TCCTAACAGACTTTCAGGTTTTTACTCACTTTACTAAAAAAAAAAAAAAA 3895 

ii i 11 t I II I 1 | | | I I I M Ml 

GCTTTCAACTCTTTATGGACTTTATTT^AACAGAGTGAATGGCAAATCTCT 2835 



3845 



3846 
2786 



38 96 AGGGCGGCCGCTTA 3909 

I III 

2836 ACTTGGACACAT . . 2847 
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GAP of: humanvrl .pep check: 6877 from: 1 to: 839 
humanVRl _Fbhl8547pat - fchrb87a6 / 3909 bases, 4554 checksum, 
to: ratvrl.pep check: 5764 from: 1 to: 838 

ratVRl | AF02931O Rattus norvegicus vanilloid receptor subtype 1 mRNA, 
complete 
cds . 

Symbol comparison table: 
/ddm_local/gcg/gcg_9 . l/acgcore/data/rundata/blosum62 - cmp 
CompCheck: 64 30 

Gap Weight: 12 Average Match: 2.912 

Length Weight: 4 Average Mismatch: -2.003 

Quality: 3734 Length: 840 

Ratio: 4.456 Gaps: 3 

Percent Similarity: 89.247 Percent Identity: 86.022 

Match display thresholds for the alignment <s ) : 
I = IDENTITY 
: - 2 
. = 1 

human vrl . pep x ratvzrl - pep 

1 MKKWSSTDLGTAADPLQKDTCPDPLDGDPNSRPPPAKPQLPTAKSRTRLF 50 

I ... I I . I I ... I II I III : It I II : I : 11 I I t I 
1 MEQRASLDSEESESPPQENSCLDPPDRDPNCKPPPVKPHIFTTRSRTRLF 50 

51 GKGDSEEAFPVDCPHEEGELDSCPTITVSPVITIQRPGDGPTGARLLSQD 100 

I I M I I I I I . I I I : M I I III If I I I : M I I I I I I I I III 
51 GKGDSEEASPLDCPYEEGGLASCPIITVSSVLTIQRPGDGPASVRPSSQD 100 

101 SVAASTEKTLRLYDRRSIFEAVAQNNCQDLESLLLFLQKSKKHLTDNEFK 150 

( I . I II I I I I I I I I I : I I I I . M I : I i I I I 111:111 I I I . I I I 
101 SVSAG.EKPPRLYDRRSIFDAVAQSNCQELESLLPFLQRSKKRLTDSEFK 14 9 

151 D PETGKTCLLKAMLNLHDGQNTT I PLLLE I ARQT DSLKELVNAS YTDS Y Y 200 

IMIIM Ml I I IIIII.M I II ||I::M.IMII: MINIMI! 
150 DPETGKTCLLKAMLNLHNGQNDTIALLLDVARKTDSLKQFVN71SYTDSYY 199 

• • • * • 

201 KGQTALH I AIERRNMALVTLLVENGADVQAAAHGDFFKKTKGRPG FYFGE 250 

I II I II M I M II M I I M I M M I M I M I . I M I M I I I I I I I I I I I 
200 KGQTALH IAIERRNMTLVTLLVENGADVQAAANGDFFKKTKGRPGFYFGE 24 9 

251 LPLSLAACTNQLGIVKFLLQNSWQTADISARDSVGNTVLHALVEVADNTA 300 

M I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
250 LPLSI-AACTNQLAIVKFLLQNSWQPADISARDSVGNTVLHALVEVADNTV 299 

• • • • . 

301 DNTKFVTShlY^EILMLGAKLHPTLKLEELTNKKGMTPLJVIAAGTGKIGVL 350 

I M I M I I M I M I . I I I I I M M M M : M : M : II I J M I .111111 
300 DNTKFVTSMYT^EILIIjGAKLHPTLKLEEITNRKGLTPIaALAASSGKIGVL 349 

351 AYILQREIQEPECRHLSRKFTEWAYGPVHSSLYDLSCIDTCEKNSVLEVI 4 00 

M I I I I I I I II M I I I M I I I I I I I I I I I I I I I I I I I I I II I I I I I I M 
350 AYILQREIHEPECRHLSRKFTEWAYGPVHSSLYDLSCIDTCEKNSVLEVI 399 
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401 AYSSSETPNRHDMLLVEPLNRLLQDKWDRFVKRIFYFNFLVYCLYMIIFT 450 

I II I I I I I I II I I I I I I I I I M I I I I I I II I I I I M I I I I I I I iiim 
4 00 A YS SS ET PNRHDMLLVE PLNRLiLQDKWDRFVKRI FY FN FFVYCL YM 1 1 FT 44 9 

451 MAAYYRPVDGLPPFKMEK. IGDYFRVTGEILSVLGGVYFFFRGIQYFLQR 499 

MIIIM:IMI:l:- =11111111111 I I I I I I I I I I I I I 

4 50 AAAY YRPVEGLP P YKLKNT VGDY FRVTGEI LS VSGGVYFFFRG I QY FLQR 4 99 

500 RPSMKTLFVDSYSEMLFFLQSLFMLATVVLYFSHLKEYVASMVFSUVLGW 54 9 

| | | : | . | I I I I I I I • H I - I I I M I -Mllll Ml Ml I 

500 RPSLKSLFVDSYSEILFFVQSLFMLVSVVLYFSQRKEYVASMVFSLAMGW 54 9 

a • 

550 TNMLYYTRGFQQMGIYAVMIEKMILRDLCRFMFVYXVFLFGFSTAVVTLI 599 

IIIIIMIIMMIIMMIIIMMIMMIIIMMMMI M 

550 TNMLYYTRGFQQMGI YAVMIEKMILRDLCRFMFVYLVFLFGFSTAVVTLI 599 

600 EDGKNDSLPSESTSHRWRGPACRPPDSSYNSLYSTCLELFKFTIGMGDLE 64 9 

II Ml | : || ||: I . I I I I I I I M I I I I I I I I I I I I I I 

600 EDGKNNSLPMESTPHKCRGSACK. PGNSYNSLYSTCLELFKFTIGMGDLE 648 

650 FTENYDFKAVFI I LLLAYVI LTYI LLLNMLI ALMGETVNKI AQESKNIWK 699 

650 ™*YUtKA , | | | | | | | I II II I M M I II M M I I II I I I I I I II I II 

649 FTENYDFI^VFIILLLAYVILTYILLLNMLIA^ 698 

749 
748 
799 
798 



700 



LQRAITILDTEKSFLKCMRKAFRSGKLLQVGYTPDGKDDYRWCFRVDEVN 

mm IIIIMMM III I Ml I II MM 1:1 Ml HI HUM 

699 LQRAITILDTEKSFLKCMRKAFRSGKLLQVGFTPDGKDDYRWCFRVDEVN 

750 WTTWNTNVGI INEDPGNCEGVKRTLSFSLRSSRVSGRHW^FALyPLLRE 
IIIIIIIIIIMIIM IN MIMIIIMM I llll-lllllllllll: 
74 9 i™ T ^GIlNEDPGNCEGVKRTLSFSLRSGRVSGRNWKNFALVPLLRD 

ASARDRQSAQPEEVYLRQFSGSLKPEDAEVFKSPAASGEK 839 
|| m . I III I : :. II II II M I I II HI 

799 ASTRC 



800 



IDRHATQQEEVQLKHYTGSLKPEDAEVFKDSMVPGEK 838 
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CLUSTAL W (1.74) multiple sequence alignment 



human VR2 .alt 
humanVR2 



KTS PSSS PVFRLETLDGGQEDGSEADRGKLiDFGSGLiPPMESQPQGEDRKFAPQI R VNLNY 



human VR2 .alt 
humanVR2 



RKGTGASQPDPNRFT>RDRI*FI^VSRGVPEDIAGLPEyijSKTSKYl<TDSEYTEGSTGKTCL 



human VR2 .alt 
human VR2 



MKAVLNIiKDGVNAC I LPLLQ I DRDSGNPQPLVNAQCTDDYYRGHSAIjH I A I EKRSLQCVK 



human VR2 .alt 
human VR2 



GRFFQKGOGTCFYFX5ELPLSLAACTKCWDWSYLLENPHQPASLQA 

LLVENGANVHARACGRFFQKGQGTCFYFGELPI^IJ^CT^ 



humanVR2 .alt 
humanVR2 



TDSQGNTVLHALVM I SDNSAEIN I ALVTSMYDG LLQAGARLCPTVQLiED I RNLQDLTPLKL 
TDS QGNTVLHAL VM I SDNSAEN I ALVTSMYDG LLQAGARLCPTVQLiE D I RNLQDLTPLKL 



human VR2 . alt 
human VR2 



AAKEGKI E I FRH I LQREFSGI^HIjSRKFTEWCYGP VRVSLYDLAS VDS CEENS VLEI IAF 
AAKEGKIEIFRHIIiQREFSGI*SHl»SRKFTEWCTGPVR^ 



human VR2 .alt 
human VR2 



HCKSPHRHRMVVLiEPL/nCLLQAKWDLLI PKFKUN FIX^NLI YMFIFTAVAYHQPTLKKQAA 
HC^PHRHRMVVIjEPLiria-IiQAKWDIJLiIPKFFXjNFI 



human VR2 .alt 
humanVR2 



PHLKAEVGNSMIJjTGHILIIjI'GGIYI^VGQ^ 

PHI*KAEVGNSMIiLTGHIIiIU/5G IYLLVGQIAfYFWRRHVFIHI S FIDS YFEII*FLFQAI*L 



human VR2 . alt 
humanVR2 



TWSQVLCFIAI EWYLPLLVSALVIXsrtflJnLiLYYTRGFQHTG I YSVMIQ 



humanVR2 .alt 
humanVR2 



I YliVFliFGFAVAIjVSIjSQEAWRPEAPTG PNATESVQ PMEGQEDEGNGAQYRG I LEASLEL 



human VR2 .alt 
bu=*nVR2 - 



. FKFTIGMGKIAFUEQIjHFR<B4VIJjI*LIAYVI^ IW 



humanVR2 . alt 
humanVR2 



- - KKMSVLEhffiNGYWKCRKKQRA^^ 
KUQKAISVIjEMENGYHHCRKKQRAGVMLTVCT 



humanVR2,alt 
humanVR2 



I/3EDPSGAGVPRTIjENPVIJ^PPKEDBIX3ASBBNYVPVQLI<)SN 
IX^DPSGAGVPRTIiENPVIASPPKEDEDGASEENYVPVQtJX^SN 



FKSGEE 21 
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4JB 



D Hydrophilteity Plot - Kyte-Doolittle 



D Hydrophob tatty Plot - Hopp-Woods 
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□ Antigenic Index - Jameson-Wolf 




a Surface Probability Plot - Emlnl 
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Protein Family / Domain Matches, HMMer version 2 

Searching for complete domains 

hmmpfam - search a single seq against HMM database 
HMMER 2.1-1 (Dec 1998) 

Copyright (C) 1992-1998 Washington University School of Medicine 

HMMER is freely distributed under the GNU General Public License (GPL) . 

*il'e: /prod/ddm/seqanal/PFAM/pfam4 ,2/Pfam 

Sequence f ale : /usr/ns-home/docs/seqanal/orfanal/oa-script . 18670 seq 

Ouory: hVR-1 

Scores for sequence family classification (score includes all domains) - 

^!!r.^!^ n Score E- value N 

ank Ank repeat 51.5 

Parsed for domains: 

Model Domain seq-f seq-t hmm-f hmm-t score E-value 

ank 1/3 201 233 1 33 I] 34.4 2.6e-06 

Mk 2/3 248 283 1 33 [J 13.2 2 

* nk 3/3 333 361 1 33 [J 3.4 26 

Alignments of top-scoring domains: 

ank: domain 1 of 3, from 201 to 233: score 34.4, E = 2.6e-06 

* - >nGnTPLHl Aarygnve wklLLehGAdvnart k< - * 
+G+T+LH+A + n+ +v lL+e+GAdv a+ 



1.9e-ll 3 



hVR-I 201 KGQTALH I AI ERRNMALVTLLVENGADVQAAAH 233 

*nk: domain 2 of 3, from 248 to 283: score 13.2, E = 2 

TPLHlAarygnvewklLLe . . . hGAdvn^* 
PL XAa ++++ +vk+LL+++ ♦ Ad+ art 



*->nGnTPLHlAarygnvewklLLe * . ! hGAdvnartk<-* 
hVR 1 oau JL PL + vk+LL+++ ♦ Ad+ art 

nVK-1 248 FGELPLSI^^QIXSrvXFLI^nswQTADISARDS 283 



Mlk! <*«Min 3 of 3, from 333 to 361: score 3.4, E ~ 26 

* - >nGnTPLHlAarygnvewklLLehGAdvnar t k< - * 
+G TPL lAa +g++ v +♦ L+ ++ 
hVR-1 333 KGMTPIAIAAGTGKIGVLAYILQ REIQEP 361 



Pi 
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Protein Family / Domain Matches, HMMer Version 2 

Searching for complete domains . „ 

tattnp£«m - search a single c«x against HMM database 

Sequence £i-Le: /taap/ortenal.SVB.ea ^. 

OOexart FXh21ell 

Scores fox sequence family clarification (ccoro includ^^ll ^«^^^ N 

ItoOal Description. 

— 1 S3. 7 <e-12 3 

OT lr FF00023 Ank repeat 

JKS* £0 ^ST1^-£ J-f^ ~°«* E-value 
— —15 ""'I* « t J 3B 3 1.7^07 

SS SS IS:: I 8 8 

Alignment* of top-ccoring domaS i - * ^ ^ l.7«-07 



194 
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CLUSTAL W ( 1 . 



humanVR2 
hVR2 .altFL 



humanVR2 
hVR2 .altFL 



humanVR2 
hVR2 .altFL 



humanVR2 
hVR2 .altFL 



human VR2 
hVR2 .altFL 



humanVR2 
hVR2 .altFL 



humanVR2 
hVR2 .altFL 



humanVR2 
hVR2. altFL 



human VR 2 
hVR2. altFL 



humanVR2 
hVR2. altFL 



human VR2 
hVR2. altFL 



humanVR2 
hVR2 .altFL 



humanVR2 
hVR2 .altFL 
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74) multiple sequence alignment 

MTSPSSSPVFRLETLDGGQEDGSEADRGKLDFGSGLPPMESQFQGEDRKFAPOtimtxti k 
MTSPSSSPVFRLETLDGGQEDGSEADRGKLDFGSGLPPME 

RKGTGASQPDPNRFDRDRLFNAVSRGVPEDLAGLPEYLSKTSKYLTDSEYTEGSTrvTOT 
RKGTGASQPDPNRFDRDRLFNAVSRGVPEDIAGLPEYLSKTSKYLTDSEYTEGSTGKTCL 

MKAVLNLKDGVNACILPLLQIDRDSGNPQPLVNAQCTDDYYRGHSALHIAIEKRSLOCVJC 
MKAVLNLKDGVNACTLPLLQIDRDSGNPQPLVNAQCTDDYYRGHSALHIAIEKRSLQCVK 

LLVENGANVHARACGRFFQKGQGTCFYFGELPLSLAACTKOWDWSYLLENPHOPASLOA 
LLVENGANVHARACGRFFQKGQGTCFYFGELPLSIAACTKOWDVVSYLLENPHQPASLQA 

TDSQGNTVLHAL VM I S DNS AEN I ALVTSMYDGLLQAGARLCPTVQLED I RNLQDLTPLKL 
TDSQGNTVLHALVMI SDNSAEN I ALVTSMYDGLLQAGARLCPTVQLED I RNLODLTPLKL 

******************* ********************************* r******r 

AAKEGKIEIFRHILQREFSGLSHLSRKFTEWCYGPVRVSLYDLASVDSCEENSVLEIIAF 

AAKEGKIEI FRHI LQREFSGLSHLSRKFTEWCYGPVRVSLYDLAS VDSCEENSVLEI IAF 
*** + *** + * + ***** + ** + + * + ** + + * + * + + * + it+i r + + * + + + + itititit i t i tif i [ i t1(ititiritititit 

HCKS PHRHRMWLEPLNKLLQAKWDLLI PKFFLNFLCNLI YMFI FTAVAYHQPTLKKQAA 
HCKSPHRHRMWLEPI^KLLQAKWDLLIPKFFLNFLCNLIYMFIFTAVAYHQPTLKKQAA 

PHLKAE VGNSMLLTGH I L I LLGG I YLLVGQLWYFWRRHVF IWISFIDSYFEI LFLFQALL 
PHLKAEVGNSMLLTGHI LI LLGGI YLLVGQLWYFWRRHVFI WI SFIDSYFEILFLFQALL 

TVVSQ\njCFIAIEWYLPLLVSALVIX;WLNLLYYTRGFQHTGIYSVMIQKVILRDLLRFLL 

TVVSQVLCFLAIEWYLPLLVSALVLGWLNLLYYTRGFQHTGIYSVMIQK 

*******************************^**********. fr1 ^^^^, Jt 

I YLVFLFGFAVALVSLSQEAWRPEAPTGPNATES VQPMEGQEDEGNGAQYRGI LEASLEL 

FKFTI GMGELAFQEQLHFRGMVLLLLLAYVLLTYI LLUOMLIALMSETVNSVATDSWSIW 

KLQKA I S VLEMENG YWWCRKKQRAGVMLTVGTKPDGS PDERWCFRVEEVNWASWEQTLPT 
KA IS VLEMENG YWWCRKKQRAGVMLTVGTKPDGS PDERWCFRVEEVNWASWEQTLPT 

LCEDPSGAGVPRTLENPVLASPPKEDEDGASEENYVPVQLLQSN 
LCEDPSGAGVPRTLENPVLASPPKEDEDGASEENYVPVQLLQSN 
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SEQUENCE LISTING 
<110> MILLENNIUM PHARMACEUTICALS, INC. 

<120> NOVEL MEMBERS OF THE CAPSAICIN /VANILLOID RECEPTOR 
FAMILY OF PROTEINS AND USES THEREOF 

<130> MNI-062CP2PC 

<140> 
<141> 

<150> 60/108,322 
<151> 1998-11-13 

<150> 60/114,078 
<151> 1998-12-28 

<150> 09/258,633 
<151> 1999-02-26 

<150> 09/421,134 
<151> 1999-10-19 

<160> 20 

<170> Patentln Ver . 2.0 

<210> 1 
<211> 3909 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 

<222> (1113) . . (3629) 
<400> 1 



gtgagcgcaa 


cgcactgcgg 


gcagtgagcg 


caacgcactg 


cgggcagtga 


gcgcaacgca 


60 


ctgcgggcag 


tgagcgcaac 


gcactgcggg 


cagtgagcgc 


aacgcactgc 


gggcagtgag 


120 


cgcaacgcac 


tgcgggcagt 


gagcgcaacg 


cactgcgggc 


agtgagcgca 


acgcacttgc 


180 


gggcagtgag 


cgcaacgcac 


tgcgggcagt 


gagcgcaacg 


cactgcgggc 


agtgagcgca 


240 


acgcactgcg 


ggcagtgagc 


gcaacgcact 


gcgggcagtg 


agcgcaacgc 


actgcgggca 


300 


gtgagcgcaa 


cgcactgcgg 


gcagtgagcg 


caacgcactg 


cgggcagtga 


gcgcaacgca 


360 


ctgcgggcag 


tgagcgcaac 


gcactgcggg 


cagtgagcgc 


aacgcactgc 


gggcagtgag 


420 


cgcaacgcac 


ttaatgtgag 


ttagctcact 


cattaggcac 


cccaggcttt 


acactttatg 


480 


cttccggctc 


gtatgttgtg 


tggaattgtg 


agcggataac 


aatttcacac 


aggaaacagc 


540 


tatgaccatg 


attacgccaa 


gctctaatac 


gactcactat 


agggaaagct 


ggtacgcctg 


600 
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caggtaccgg tccggaattc ccgggtcgac ccacgcgtcc gaaaacacac ctctctgctg 660 

tgggaagact gtgcaatggc acagccgcag agcttggttt gggaggttga agtgctctgg 720 

ggagaattcg tagatcatcc tcagaaaagc cttgccctgg tgttctacca gaaaaacgtc 780 

tcccaatcac ccagaaaagc tgtccacagt agtcccccct tatccacggg tgtcactttc 840 

catgggttca gttatttgcg gtcaaccacg gtctgccaat attaaatgga aaattcttca 900 

aacagttccc aagttttccc ttgtgcattg ttctgagcag tgtgatgaag agtctctgcc 960 

gtgccatctg ggatgcaaac cgtccctgtg tcccccacgt ccaggccgta gatgctcccc 1020 

gccggtcagt cacttagtcg tcagatcgcc cgtcctggta tcacagtgct tctgttcagg 1080 

ttgcacactg ggccacagag gatccagcaa gg atg aag aaa tgg age age aca 1133 

Met Lys Lys Trp Ser Ser Thr 
1 5 

gac ttg ggg aca get gcg gac cca etc caa aag gac ace tgc cca gac 1181 
Asp Leu Gly Thr Ala Ala Asp Pro Leu Gin Lys Asp Thr Cys Pro Asp 
10 15 20 

ccc ctg gat gga gac cct aac tec agg cca cct cca gee aag ccc cag 1229 
Pro Leu Asp Gly Asp Pro Asn Ser Arg Pro Pro Pro Ala Lys Pro Gin 
25 30 35 

etc ccc acg gee aag age cgc acc egg etc ttt ggg aag ggt gac teg 1277 
Leu Pro Thr Ala Lys Ser Arg Thr Arg Leu Phe Gly Lys Gly Asp Ser 
40 45 50 55 

gag gag get ttc ccg gtg gat tgc ccc cac gag gaa ggt gag ttg gac 1325 
Glu Glu Ala Phe Pro Val Asp Cys Pro His Glu Glu Gly Glu Leu Asp 
60 65 70 

tec tgc ccg acc ate aca gtc age cct gtt ate acc ate cag agg cca 1373 
Ser Cys Pro Thr lie Thr Val Ser Pro Val lie Thr lie Gin Arg Pro 
75 80 85 

gga gac ggc ccc acc ggt gee agg ctg ctg tec cag gac tct gtc gee 1421 
Gly Asp Gly Pro Thr Gly Ala Arg Leu Leu Ser Gin Asp Ser Val Ala 
90 95 100 

gee age acc gag aag acc etc agg etc tat gat cgc agg agt ate ttt 1469 
Ala Ser Thr Glu Lys Thr Leu Arg Leu Tyr Asp Arg Arg Ser lie Phe 
105 110 115 

gaa gee gtt get cag aat aac tgc cag gat ctg gag age ctg ctg etc 1517 
Glu Ala Val Ala Gin Asn Asn Cys Gin Asp Leu Glu Ser Leu Leu Leu 
120 125 130 135 

ttc ctg cag aag age aag aag cac etc aca gac aac gag ttc aaa gac 1565 
Phe Leu Gin Lys Ser Lys Lys His Leu Thr Asp Asn Glu Phe Lys Asp 
140 145 150 

cct gag aca ggg aag acc tgt ctg ctg aaa gee atg etc aac ctg cac 1613 
Pro Glu Thr Gly Lys Thr Cys Leu Leu Lys Ala Met Leu Asn Leu His 
155 160 165 
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gac gga cag aac acc acc ate ccc ctg, etc ctg gag ate gcg egg caa 
Asp Gly Gin Asn Thr Thr lie Pro Leu Leu Leu Glu lie Ala Arg Gin 
170 175 180 

acg gac age ctg aag gag ctt gtc aac gec age tac acg gac age tac 
Thr Asp Ser Leu Lys Glu Leu Val Asn Ala Ser Tyr Thr Asp Ser Tyr 
185 190 195 



330 335 340 

acc ggg aag ate ggg gtc ttg gec tat att etc cag egg gag ate cag 

Thr Gly Lys lie Gly Val Leu Ala Tyr lie Leu Gin Arg Glu lie Gin 
345 350 355 



ggg ccc gtg cac tec teg ctg tac gac ctg tec tgc ate gac acc tgc 
Gly Pro Val His Ser Ser Leu Tyr Asp Leu Ser Cys lie Asp Thr Cys 
380 385 390 



1661 



1709 



tac aag ggc cag aca gca ctg cac ate gec ate gag aga cgc aac atg 1757 
Tyr Lys Gly Gin Thr Ala Leu His He Ala He Glu Arg Arg Asn Met 
200 205 210 215 

gee ctg gtg acc etc ctg gtg gag aac gga gca gac gtc cag get gcg 1805 
Ala Leu Val Thr Leu Leu Val Glu Asn Gly Ala Asp Val Gin Ala Ala 
220 225 230 

gec cat ggg gac ttc ttt aag aaa acc aaa ggg egg cct gga ttc tac 
Ala His Gly Asp Phe Phe Lys Lys Thr Lys Gly Arg Pro Gly Phe Tyr 
235 240 245 

ttc ggt gaa ctg ccc ctg tec ctg gec gcg tgc acc aac cag ctg ggc 
Phe Gly Glu Leu Pro Leu Ser Leu Ala Ala Cys Thr Asn Gin Leu Gly 
250 255 260 

ate gtg aag ttc ctg ctg cag aac tec tgg cag acg gee gac ate age 
He Val Lys Phe Leu Leu Gin Asn Ser Trp Gin Thr Ala Asp He Ser 
265 270 275 

gec agg gac teg gtg ggc aac acg gtg ctg cac gee ctg gtg gag gtg 
Ala Arg Asp Ser Val Gly Asn Thr Val Leu His Ala Leu Val Glu Val 
280 285 290 295 

gee gac aac acg gec gac aac acg aag ttt gtg acg age atg tac aat 
Ala Asp Asn Thr Ala Asp Asn Thr Lys Phe Val Thr Ser Met Tyr Asn 
300 305 310 

gag att ctg atg ctg ggg gec aaa ctg cac ccg acg ctg aag ctg gag 
Glu He Leu Met Leu Gly Ala Lys Leu His Pro Thr Leu Lys Leu Glu 
315 320 325 

gag etc acc aac aag aag gga atg acg ccg ctg get ctg gca get ggg 2141 
Glu Leu Thr Asn Lys Lys Gly Met Thr Pro Leu Ala Leu Ala Ala Gly 



1853 



1901 



1949 



1997 



2045 



2093 



2189 



gag ccc gag tgc agg cac ctg tec agg aag ttc acc gag tgg gee tac 2237 
Glu Pro Glu Cys Arg His Leu Ser Arg Lys Phe Thr Glu Trp Ala Tyr 
360 365 370 375 



2285 



gag aag aac teg gtg ctg gag gtg ate gec tac age age age gag acc 2333 
Glu Lys Asn Ser Val Leu Glu Val lie Ala Tyr Ser Ser Ser Glu Thr 
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395 400 405 

cct aat cgc cac gac atg etc ttg gtg gag ccg ctg aac cga etc ctg 2381 

Pro Asn Arg His Asp Met Leu Leu Val Glu Pro Leu Asn Arg Leu Leu 
410 415 420 



cag gac aag tgg gac aga ttc gtc aag cgc ate ttc tac ttc aac ttc 2429 
Gin Asp Lys Trp Asp Arg Phe Val Lys Arg lie Phe Tyr Phe Asn Phe 
425 430 435 

ctg gtc tac tgc ctg tac atg ate ate ttc acc atg get gec tac tac 2477 
Leu Val Tyr Cys Leu Tyr Met lie lie Phe Thr Met Ala Ala Tyr Tyr 
440 445 450 455 

agg ccc gtg gat ggc ttg cct ccc ttt aag atg gaa aaa att gga gac. 2525 
Arg Pro Val Asp Gly Leu Pro Pro Phe Lys Met Glu Lys lie Gly Asp 
460 465 470 

tat ttc cga gtt act gga gag ate ctg tct gtg tta gga gga gtc tac 2573 
Tyr Phe Arg Val Thr Gly Glu lie Leu Ser Val Leu Gly Gly Val Tyr 
475 480 485 

ttc ttt ttc cga ggg att cag tat ttc ctg cag agg egg ccg teg atg 2621 
Phe Phe Phe Arg Gly lie Gin Tyr Phe Leu Gin Arg Arg Pro Ser Met 
490 495 500 

aag acc ctg ttt gtg gac age tac agt gag atg ctt ttc ttt ctg cag 2669 
Lys Thr Leu Phe Val Asp Ser Tyr Ser Glu Met Leu Phe Phe Leu Gin 
505 510 515 

tea ctg ttc atg ctg gee acc gtg gtg ctg tac ttc age cac etc aag 2717 
Ser Leu Phe Met Leu Ala Thr Val Val Leu Tyr Phe Ser His Leu Lys 
520 525 530 535 

gag tat gtg get tec atg gta ttc tec ctg gee ttg ggc tgg acc aac 2765 
Glu Tyr Val Ala Ser Met Val Phe Ser Leu Ala Leu Gly Trp Thr Asn 
540 545 550 

atg etc tac tac acc cgc ggt ttc cag cag atg ggc ate tat gee gtc 2813 
Met Leu Tyr Tyr Thr Arg Gly Phe Gin Gin Met Gly lie Tyr Ala Val 
555 560 565 

atg ata gag aag atg ate ctg aga gac ctg tgc cgt ttc atg ttt gtc 2861 
Met lie Glu Lys Met lie Leu Arg Asp Leu Cys Arg Phe Met Phe Val 
570 575 580 

tac ate gtc ttc ttg ttc ggg ttt tec aca gcg gtg gtg acg ctg att 2909 
Tyr lie Val Phe Leu Phe Gly Phe Ser Thr Ala Val Val Thr Leu lie 
585 590 595 

gaa gac ggg aag aat gac tec ctg ccg tct gag tec acg teg cac agg 2957 
Glu Asp Gly Lys Asn Asp Ser Leu Pro Ser Glu Ser Thr Ser His Arg 
600 605 610 615 

tgg egg ggg cct gee tgc agg ccc ccc gat age tec tac aac age ctg 3005 
Trp Arg Gly Pro Ala Cys Arg Pro Pro Asp Ser Ser Tyr Asn Ser Leu 
620 625 630 
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tac tec acc tgc ctg gag ctg ttc aag ttc acc ate ggc atg ggc gac 3053 

Tyr Ser Thr Cys Leu Glu Leu Phe Lys Phe Thr He Gly Met Gly Asp 
635 640 645 

ctg gag ttc act gag aac tat gac ttc aag get gtc ttc ate ate ctg 3101 
Leu Glu Phe Thr Glu Asn Tyr Asp Phe Lys Ala Val Phe He He Leu 
650 655 660 



ctg ctg gec tat gta att etc acc tac ate etc ctg etc aac atg etc 
Leu Leu Ala Tyr Val He Leu Thr Tyr He Leu Leu Leu Asn Met Leu 
665 670 675 



age :tc ctt aag tgc atg agg aag gee ttc cgc tea ggc aag ctg ctg 
Ser rhe Leu Lys Cys Met Arg Lys Ala Phe Arg Ser Gly Lys Leu Leu 
715 720 725 



get cag ccc gag gaa gtt tat ctg cga cag ttt tea ggg tct ctg aag 
Ala Gin Pro Glu Glu Val Tyr Leu Arg Gin Phe Ser Gly Ser Leu Lys 
810 815 820 



3149 



ate gee etc atg ggt gag act gtc aac aag ate gca cag gag age aag 3197 
He Ala Leu Met Gly Glu Thr Val Asn Lys He Ala Gin Glu Ser Lys 
680 685 690 695 

aac ate tgg aag ctg cag aga gee ate acc ate ctg gac acg gag aag 3245 
Asn He Trp Lys Leu Gin Arg Ala He Thr He Leu Asp Thr Glu Lys 
700 705 710 



3293 



cag gtg ggg tac aca cct gat ggc aag gac gac tac egg tgg tgc ttc 3341 

Gin Val Gly Tyr Thr Pro Asp Gly Lys Asp Asp Tyr Arg Trp Cys Phe 

730 735 740 

agg gtg gac gag gtg aac tgg acc acc tgg aac acc aac gtg ggc ate 3389 

Arg Val Asp Glu Val Asn Trp Thr Thr Trp Asn Thr Asn Val Gly He 

745 750 755 

ate aac gaa gac ccg ggc aac tgt gag ggc gtc aag cgc acc ctg age 3437 

He Asn Glu Asp Pro Gly Asn Cys Glu Gly Val Lys Arg Thr Leu Ser 

760 765 770 775 

ttc tec ctg egg tea age aga gtt tea ggc aga cac tgg aag aac ttt 3485 

Phe Ser Leu Arg Ser Ser Arg Val Ser Gly Arg His Trp Lys Asn Phe 

780 785 790 

gec ctg gtc ccc ctt tta aga gag gca agt get cga gat agg cag tct 3533 

Ala Leu Val Pro Leu Leu Arg Glu Ala Ser Ala Arg Asp Arg Gin Ser 

795 800 805 



3581 



3629 



cca gag gac get gag gtc ttc aag agt cct gee get tec ggg gag aag 
Pro Glu Asp Ala Glu Val Phe Lys Ser Pro Ala Ala Ser Gly Glu Lys 
825 830 835 

tgaggacgtc aegcagacag cactgtcaac actgggcett aggagacccc gttgecaegg 3689 

ggggctgctg agggaacacc agtgctctgt cagcagcctg gcctggtctg tgcctgccca 374 9 

gcatgttccc aaatctgtgc tggacaagct gtgggaagcg ttcttggaag catggggagt 3809 
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gatgtacatc caaccgtcac tgtccccaag tgaatctcct aacagacttt caggttttta 3869 
ctcactttac taaaaaaaaa aaaaaaaggg cggccgctta 3909 



<210> 2 
<211> 839 
<212> PRT 

<213> Homo sapiens 
<400> 2 

Met Lys Lys Trp Ser Ser Thr Asp Leu Gly Thr Ala Ala Asp Pro Leu 
15 10 15 

Gin Lys Asp Thr Cys Pro Asp Pro Leu Asp Gly Asp Pro Asn Ser Arg 
20 25 30 

Pro Pro Pro Ala Lys Pro Gin Leu Pro Thr Ala Lys Ser Arg Thr Arg 
35 40 45 

Leu Phe Gly Lys Gly Asp Ser Glu Glu Ala Phe Pro Val Asp Cys Pro 
50 55 60 

His Glu Glu Gly Glu Leu Asp Ser Cys Pro Thr lie Thr Val Ser Pro 
65 70 75 80 

Val lie Thr lie Gin Arg Pro Gly Asp Gly Pro Thr Gly Ala Arg Leu 
85 90 95 

Leu Ser Gin Asp Ser Val Ala Ala Ser Thr Glu Lys Thr Leu Arg Leu 
100 105 110 

Tyr Asp Arg Arg Ser lie Phe Glu Ala Val Ala Gin Asn Asn Cys Gin 
115 120 125 

Asp Leu Glu Ser Leu Leu Leu Phe Leu Gin Lys Ser Lys Lys His Leu 
130 135 140 

Thr Asp Asn Glu Phe Lys Asp Pro Glu Thr Gly Lys Thr Cys Leu Leu 
145 150 155 160 

Lys Ala Met Leu Asn Leu His Asp Gly Gin Asn Thr Thr lie Pro Leu 
165 170 175 

Leu Leu Glu lie Ala Arg Gin Thr Asp Ser Leu Lys Glu Leu Val Asn 
180 185 190 

Ala Ser Tyr Thr Asp Ser Tyr Tyr Lys Gly Gin Thr Ala Leu His lie 
195 200 205 

Ala lie Glu Arg Arg Asn Met Ala Leu Val Thr Leu Leu Val Glu Asn 
210 215 220 
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Gly Ala Asp Val 
225 

Lys Gly Arg Pro 



Ala Cys Thr Asn 
260 

Trp Gin Thr Ala 
275 



Gin Ala Ala Ala 
230 

Gly Phe Tyr Phe 
245 

Gin Leu Gly lie 



Asp lie Ser Ala 
280 



His Gly Asp Phe 
235 

Gly Glu Leu Pro 
250 

Val Lys Phe Leu 
265 

Arg Asp Ser Val 



Phe Lys Lys Thr 
240 

Leu Ser Leu Ala 
255 

Leu Gin Asn Ser 
270 

Gly Asn Thr Val 
285 



Leu His Ala Leu 
290 

Phe Val Thr Ser 
305 

His Pro Thr Leu 



Pro Leu Ala Leu 
340 

lie Leu Gin Arg 
355 

Lys Phe Thr Glu 
370 

Leu Ser Cys lie 
385 

Ala Tyr Ser Ser 



Glu Pro Leu Asn 
420 

Arg lie Phe Tyr 
435 

Phe Thr Met Ala 
450 

Lys Met Glu Lys 
4 65 

Ser Val Leu Gly 



Leu Gin Arg Arg 
500 

Glu Met Leu Phe 
515 

Leu Tyr Phe Ser 



Val Glu Val Ala 
295 

Met Tyr Asn Glu 
310 

Lys Leu Glu Glu 
325 

Ala Ala Gly Thr 



Glu lie Gin Glu 
360 

Trp Ala Tyr Gly 
375 

Asp Thr Cys Glu 
390 

Ser Glu Thr Pro 
405 

Arg Leu Leu Gin 



Phe Asn Phe Leu 
440 

Ala Tyr Tyr Arg 
455 

lie Gly Asp Tyr 
470 

Gly Val Tyr Phe 
485 

Pro Ser Met Lys 



Phe Leu Gin Ser 
520 

His Leu Lys Glu 



Asp Asn Thr Ala 
300 

lie Leu Met Leu 
315 

Leu Thr Asn Lys 
330 

Gly Lys lie Gly 
345 

Pro Glu Cys Arg 



Pro Val His Ser 
380 

Lys Asn Ser Val 
395 

Asn Arg His Asp 
410 

Asp Lys Trp Asp 
425 

Val Tyr Cys Leu 



Pro Val Asp Gly 
460 

Phe Arg Val Thr 
475 

Phe Phe Arg Gly 
4 90 

Thr Leu Phe Val 
505 

Leu Phe Met Leu 



Tyr Val Ala Ser 



Asp Asn Thr Lys 



Gly Ala Lys Leu 
320 

Lys Gly Met Thr 
335 

Val Leu Ala Tyr 
350 

His Leu Ser Arg 
365 

Ser Leu Tyr Asp 



Leu Glu Val lie 
400 

Met Leu Leu Val 
415 

Arg Phe Val Lys 
430 

Tyr Met lie lie 
445 

Leu Pro Pro Phe 



Gly Glu lie Leu 
480 

lie Gin Tyr Phe 
495 

Asp Ser Tyr Ser 
510 

Ala Thr Val Val 
525 

Met Val Phe Ser 
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530 535 540 

Leu Ala Leu Gly Trp Thr Asn Met Leu Tyr Tyr Thr Arg Gly Phe Gin 
545 550 555 560 

Gin Met Gly lie Tyr Ala Val Met lie Glu Lys Met lie Leu Arg Asp 
565 570 575 

Leu Cys Arg Phe Met Phe Val Tyr lie Val Phe Leu Phe Gly Phe Ser 
580 585 590 

Thr Ala Val Val Thr Leu lie Glu Asp Gly Lys Asn Asp Ser Leu Pro 
595 600 605 



Ser Glu Ser Thr Ser His Arg Trp Arg Gly Pro Ala Cys Arg Pro Pro 
610 615 620 

Asp Ser Ser Tyr Asn Ser Leu Tyr Ser Thr Cys Leu Glu Leu Phe Lys 
625 630 635 640 

Phe Thr lie Gly Met Gly Asp Leu Glu Phe Thr Glu Asn Tyr Asp Phe 
645 650 655 

Lys Ala Val Phe lie lie Leu Leu Leu Ala Tyr Val lie Leu Thr Tyr 
660 665 670 

lie Leu Leu Leu Asn Met Leu lie Ala Leu Met Gly Glu Thr Val Asn 
675 680 685 

Lys lie Ala Gin Glu Ser Lys Asn lie Trp Lys Leu Gin Arg Ala lie 
690 695 700 

Thr lie Leu Asp Thr Glu Lys Ser Phe Leu Lys Cys Met Arg Lys Ala 
705 710 715 720 

Phe Arg Ser Gly Lys Leu Leu Gin Val Gly Tyr Thr Pro Asp Gly Lys 
725 730 735 

Asp Asp Tyr Arg Trp Cys Phe Arg Val Asp Glu Val Asn Trp Thr Thr 
740 745 750 

Trp Asn Thr Asn Val Gly lie lie Asn Glu Asp Pro Gly Asn Cys Glu 
755 760 765 

Gly Val Lys Arg Thr Leu Ser Phe Ser Leu Arg Ser Ser Arg Val Ser 
770 775 780 

Gly Arg His Trp Lys Asn Phe Ala Leu Val Pro Leu Leu Arg Glu Ala 
785 790 795 800 

Ser Ala Arg Asp Arg Gin Ser Ala Gin Pro Glu Glu Val Tyr Leu Arg 
805 810 815 

Gin Phe Ser Gly Ser Leu Lys Pro Glu Asp Ala Glu Val Phe Lys Ser 
820 825 830 

Pro Ala Ala Ser Gly Glu Lys 
835 
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<210> 3 

<211> 2517 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1) . . (2517) 

<400> 3 

atg aag aaa tgg age age aca gac ttg ggg aca get gcg gac cca etc 48 

Met Lys Lys Trp Ser Ser Thr Asp Leu Gly Thr Ala Ala Asp Pro Leu 
15 10 15 



caa aag gac ace tgc cca gac ccc ctg gat gga gac cct aac tec agg 96 
Gin Lys Asp Thr Cys Pro Asp Pro Leu Asp Gly Asp Pro Asn Ser Arg 
20 25 30 

cca cct cca gee aag ccc cag etc ccc acg gee aag age cgc acc egg 144 
Pro Pro Pro Ala Lys Pro Gin Leu Pro Thr Ala Lys Ser Arg Thr Arg 
35 40 45 

etc ttt ggg aag ggt gac teg gag gag get ttc ccg gtg gat tgc ccc 192 
Leu Phe Gly Lys Gly Asp Ser Glu Glu Ala Phe Pro Val Asp Cys Pro 
50 55 60 

cac gag gaa ggt gag ttg gac tec tgc ccg acc ate aca gtc age cct 240 
His Glu Glu Gly Glu Leu Asp Ser Cys Pro Thr lie Thr Val Ser Pro 
65 70 75 80 

gtt ate acc ate cag agg cca gga gac ggc ccc acc ggt gec agg ctg 288 
Val lie Thr He Gin Arg Pro Gly Asp Gly Pro Thr Gly Ala Arg Leu 
85 90 95 

ctg tec cag gac tct gtc gee gee age acc gag aag acc etc agg etc 336 
Leu Ser Gin Asp Ser Val Ala Ala Ser Thr Glu Lys Thr Leu Arg Leu 
100 105 HO 



tat gat cgc agg agt ate ttt gaa gee gtt get cag aat aac tgc cag 
Tyr Asp Arg Arg Ser He Phe Glu Ala Val Ala Gin Asn Asn Cys Gin 
115 120 125 



384 



gat ctg gag age ctg ctg etc ttc ctg cag aag age aag aag cac etc 432 
Asp Leu Glu Ser Leu Leu Leu Phe Leu Gin Lys Ser Lys Lys His Leu 
130 135 140 



aca gac aac gag ttc aaa gac cct gag 
Thr Asp Asn Glu Phe Lys Asp Pro Glu 
145 150 

aaa gee atg etc aac ctg cac gac gga 
Lys Ala Met Leu Asn Leu His Asp Gly 
165 

etc ctg gag ate gcg egg caa acg gac 



aca ggg aag acc tgt ctg ctg 480 
Thr Gly Lys Thr Cys Leu Leu 
155 160 

cag aac acc acc ate ccc ctg 528 
Gin Asn Thr Thr lie Pro Leu 
170 175 

age ctg aag gag ctt gtc aac 576 
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Leu Leu Glu lie Ala Arg Gin Thr Asp Ser Leu Lys Glu Leu Val Asn 
180 185 190 

gcc age tac acg gac age tac tac aag ggc cag aca gca ctg cac ate 624 
Ala Ser Tyr Thr Asp Ser Tyr Tyr Lys Gly Gin Thr Ala Leu His lie 
195 200 205 

gcc ate gag aga cgc aac atg gcc ctg gtg acc etc ctg gtg gag aac 672 
Ala lie Glu Arg Arg Asn Met Ala Leu Val Thr Leu Leu Val Glu Asn 
210 215 220 

gga gca gac gtc cag get gcg gcc cat ggg gac ttc ttt aag aaa acc 720 
Gly Ala Asp Val Gin Ala Ala Ala His Gly Asp Phe Phe Lys Lys Thr 
225 230 235 240 

aaa ggg egg cct gga ttc tac ttc ggt gaa ctg ccc ctg tec ctg gcc 768 
Lys Gly Arg Pro Gly Phe Tyr Phe Gly Glu Leu Pro Leu Ser Leu Ala 
245 250 255 

gcg tgc acc aac cag ctg ggc ate gtg aag ttc ctg ctg cag aac tec 816 
Ala Cys Thr Asn Gin Leu Gly lie Val Lys Phe Leu Leu Gin Asn Ser 
260 265 270 

tgg cag acg gcc gac ate age gcc agg gac teg gtg ggc aac acg gtg 864 
Trp Gin Thr Ala Asp lie Ser Ala Arg Asp Ser Val Gly Asn Thr Val 
275 280 285 

ctg cac gcc ctg gtg gag gtg gcc, gac aac acg gcc gac aac acg aag 912 
Leu His Ala Leu Val Glu Val Ala Asp Asn Thr Ala Asp Asn Thr Lys 
290 295 300 

ttt gtg acg age atg tac aat gag att ctg atg ctg ggg gcc aaa ctg 960 
Phe Val Thr Ser Met Tyr Asn Glu lie Leu Met Leu Gly Ala Lys Leu 
305 310 315 320 

cac ccg acg ctg aag ctg gag gag etc acc aac aag aag gga atg acg 1008 
His Pro Thr Leu Lys Leu Glu Glu Leu Thr Asn Lys Lys Gly Met Thr 
325 330 335 

ccg ctg get ctg gca get ggg acc ggg aag ate ggg gtc ttg gcc tat 1056 
Pro Leu Ala Leu Ala Ala Gly Thr Gly Lys lie Gly Val Leu Ala Tyr 
340 345 350 

att etc cag egg gag ate cag gag ccc gag tgc agg cac ctg tec agg 1104 
lie Leu Gin Arg Glu lie Gin Glu Pro Glu Cys Arg His Leu Ser Arg 
355 360 365 

aag ttc acc gag tgg gcc tac ggg ccc gtg cac tec teg ctg tac gac 1152 
Lys Phe Thr Glu Trp Ala Tyr Gly Pro Val His Ser Ser Leu Tyr Asp 
370 375 380 

ctg tec tgc ate gac acc tgc gag aag aac teg gtg ctg gag gtg ate 1200 
Leu Ser Cys lie Asp Thr Cys Glu Lys Asn Ser Val Leu Glu Val lie 
385 390 395 400 

gcc tac age age age gag acc cct aat cgc cac gac atg etc ttg gtg 1248 
Ala Tyr Ser Ser Ser Glu Thr Pro Asn Arg His Asp Met Leu Leu Val 
405 410 415 
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gag ccg ctg aac cga etc ctg cag gac aag tgg gac aga ttc gtc aag 1296 
Glu Pro Leu Asn Arg Leu Leu Gin Asp Lys Trp Asp Arg Phe Val Lys 
420 425 430 

cgc ate ttc tac ttc aac ttc ctg gtc tac tgc ctg tac atg ate ate 1344 
Arg lie Phe Tyr Phe Asn Phe Leu Val Tyr Cys Leu Tyr Met lie lie 
435 440 445 

ttc acc atg get gec tac tac agg ccc gtg gat ggc ttg cct ccc ttt 
Phe Thr Met Ala Ala Tyr Tyr Arg Pro Val Asp Gly Leu Pro Pro Phe 
450 455 460 

aag atg gaa aaa att gga gac tat ttc cga gtt act gga gag ate ctg 
Lys Met Glu Lys lie Gly Asp Tyr Phe Arg Val Thr Gly Glu lie Leu 
465 470 475 480 

tct gtg tta gga gga gtc tac ttc ttt ttc cga ggg att cag tat ttc 1488 
Ser Val Leu Gly Gly Val Tyr Phe Phe Phe Arg Gly He Gin Tyr Phe 
485 490 495 



ctg cag agg egg ccg teg atg aag acc ctg ttt gtg gac age tac agt 
Leu Gin Arg Arg Pro Ser Met Lys Thr Leu Phe Val Asp Ser Tyr Ser 
500 505 510 

gag atg ctt ttc ttt ctg cag tea ctg ttc atg ctg gee acc gtg gtg 
Glu Met Leu Phe Phe Leu Gin Ser Leu Phe Met Leu Ala Thr Val Val 
515 520 525 

ctg tac ttc age cac etc aag gag tat gtg get tec atg gta ttc tec 
Leu Tyr Phe Ser His Leu Lys Glu Tyr Val Ala Ser Met Val Phe Ser 



530 



535 540 



ctg gec ttg ggc tgg acc aac atg etc tac tac acc cgc ggt ttc cag 
Leu Ala Leu Gly Trp Thr Asn Met Leu Tyr Tyr Thr Arg Gly Phe Gin 
545 550 555 560 



1392 



1440 



1536 



1584 



1632 



1680 



1728 



cag atg ggc ate tat gee gtc atg ata gag aag atg ate ctg aga gac 

Gin Met Gly He Tyr Ala Val Met He Glu Lys Met He Leu Arg Asp 

565 570 575 

ctg tgc cgt ttc atg ttt gtc tac ate gtc ttc ttg ttc ggg ttt tec 1776 

Leu Cys Arg Phe Met Phe Val Tyr He Val Phe Leu Phe Gly Phe Ser 

580 585 590 



aca gcg gtg gtg acg ctg att gaa gac ggg aag aat gac tec ctg ccg 

Thr Ala Val Val Thr Leu lie Glu Asp Gly Lys Asn Asp Ser Leu Pro 

595 600 605 

tct gag tec acg teg cac agg tgg egg ggg cct gee tgc agg ccc ccc 

Ser Glu Ser Thr Ser His Arg Trp Arg Gly Pro Ala Cys Arg Pro Pro 

610 615 620 

gat age tec tac aac age ctg tac tec acc tgc ctg gag ctg ttc aag 

Asp Ser Ser Tyr Asn Ser Leu Tyr Ser Thr Cys Leu Glu Leu Phe Lys 

625 630 635 640 

ttc acc ate ggc atg ggc gac ctg gag ttc act gag aac tat gac ttc 



1824 



1872 



1920 



1968 
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Phe Thr lie Gly Met Gly Asp Leu Glu Phe Thr Glu Asn Tyr Asp Phe 
645 650 655 

aag get gtc ttc ate ate ctg ctg ctg gec tat gta att etc acc tac 2016 
Lys Ala Val Phe lie lie Leu Leu Leu Ala Tyr Val lie Leu Thr Tyr 
660 665 670 

ate etc ctg etc aac atg etc ate gee etc atg ggt gag act gtc aac 2064 
lie Leu Leu Leu Asn Met Leu lie Ala Leu Met Gly Glu Thr Val Asn 
675 680 685 

aag ate gca cag gag age aag aac ate tgg aag ctg cag aga gee ate 2112 
Lys lie Ala Gin Glu Ser Lys Asn lie Trp Lys Leu Gin Arg Ala He 
690 695 700 

acc ate ctg gac acg gag aag age ttc ctt aag tgc atg agg aag gee 2160 
Thr He Leu Asp Thr Glu Lys Ser Phe Leu Lys Cys Met Arg Lys Ala 
705 710 715 720 

ttc cgc tea ggc aag ctg ctg cag gtg ggg tac aca cct gat ggc aag 2208 
Phe Arg Ser Gly Lys Leu Leu Gin Val Gly Tyr Thr Pro Asp Gly Lys 
725 730 735 

gac gac tac egg tgg tgc ttc agg gtg gac gag gtg aac tgg acc acc 2256 
Asp Asp Tyr Arg Trp Cys Phe Arg Val Asp Glu Val Asn Trp Thr Thr 
740 745 750 

tgg aac acc aac gtg ggc ate ate aac gaa gac ccg ggc aac tgt gag 2304 
Trp Asn Thr Asn Val Gly He He Asn Glu Asp Pro Gly Asn Cys Glu 
755 760 765 

ggc gtc aag cgc acc ctg age ttc tec ctg egg tea age aga gtt tea 2352 
Gly Val Lys Arg Thr Leu Ser Phe Ser Leu Arg Ser Ser Arg Val Ser 
770 775 780 

ggc aga cac tgg aag aac ttt gee ctg gtc ccc ctt tta aga gag gca 2400 
Gly Arg His Trp Lys Asn Phe Ala Leu Val Pro Leu Leu Arg Glu Ala 
785 790 795 800 

agt get cga gat agg cag tct get cag ccc gag gaa gtt tat ctg cga 2448 
Ser Ala Arg Asp Arg Gin Ser Ala Gin Pro Glu Glu Val Tyr Leu Arg 
805 810 815 



cag ttt tea ggg tct ctg aag cca gag gac get gag gtc ttc aag agt 2496 
Gin Phe Ser Gly Ser Leu Lys Pro Glu Asp Ala Glu Val Phe Lys Ser 
820 825 830 

cct gee get tec ggg gag aag 2517 
Pro Ala Ala Ser Gly Glu Lys 
835 

<210> 4 
<211> 2809 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
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<222> (361) . . (2652) 
<400> 4 

ggctagcctg tcctgacagg ggagagttaa gctcccgttc tccaccgtgc cggctggcca 60 
ggtgggctga gggtgaccga gagaccagaa cctgcttgct ggagcttagt gctcagagct 120 
ggggagggag gttccgccgc tcctctgctg tcagcgccgg cagcccctcc cggcttcact 180 
tcctcccgca gcccctgcta ctgagaagct ccgggatccc agcagccgcc acgccctggc 240 
ctcagcctgc ggggctccag tcaggccaac accgacgcgc agctgggagg aagacaggac 300 
ccttgacatc tccatctgca cagaggtcct ggctggaccg agcagcctcc tcctcctagg 360 



atg acc tea ccc tec age tct cca gtt ttc agg ttg gag aca tta gat 

Met Thr Ser Pro Ser Ser Ser Pro Val Phe Arg Leu Glu Thr Leu Asp 

15 10 15 

gga ggc caa gaa gat ggc tct gag gcg gac aga gga aag ctg gat ttt 

Gly Gly Gin Glu Asp Gly Ser Glu Ala Asp Arg Gly Lys Leu Asp Phe 

20 25 30 



408 



456 



ggg age ggg ctg cct ccc atg gag tea cag ttc cag ggc gag gac egg 504 
Gly Ser Gly Leu Pro Pro Met Glu Ser Gin Phe Gin Gly Glu Asp Arg 
35 40 45 

aaa ttc gec cct cag ata aga gtc aac etc aac tac cga aag gga aca 552 
Lys Phe Ala Pro Gin lie Arg Val Asn Leu Asn Tyr Arg Lys Gly Thr 
50 55 60 

ggt gee agt cag ccg gat cca aac cga ttt gac cga gat egg etc ttc 600 
Gly Ala Ser Gin Pro Asp Pro Asn Arg Phe Asp Arg Asp Arg Leu Phe 
65 70 75 80 



aat gcg gtc tec egg ggt gtc ccc gag gat ctg get gga ctt cca gag 

Asn Ala Val Ser Arg Gly Val Pro Glu Asp Leu Ala Gly Leu Pro Glu 

85 90 95 

tac ctg age aag acc age aag tac etc acc gac teg gaa tac aca gag 

Tyr Leu Ser Lys Thr Ser Lys Tyr Leu Thr Asp Ser Glu Tyr Thr Glu 

100 105 110 

ggc tec aca ggt aag acg tgc ctg atg aag get gtg ctg aac ctt aag 

Gly Ser Thr Gly Lys Thr Cys Leu Met Lys Ala Val Leu Asn Leu Lys 

115 120 125 

gac gga gtc aat gee tgc att ctg cca ctg ctg cag ate gac agg gac 

Asp Gly Val Asn Ala Cys lie Leu Pro Leu Leu Gin lie Asp Arg Asp 

130 135 140 

tct ggc aat cct cag ccc ctg gta aat gee cag tgc aca gat gac tat 

Ser Gly Asn Pro Gin Pro Leu Val Asn Ala Gin Cys Thr Asp Asp Tyr 

145 150 155 160 

tac cga ggc cac age get ctg cac ate gee att gag aag agg agt ctg 

Tyr Arg Gly His Ser Ala Leu His lie Ala lie Glu Lys Arg Ser Leu 



648 



696 



744 



792 



840 



888 
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165 170 175 

cag tgt gtg aag etc ctg gtg gag aat ggg gec aat gtg cat gec egg 936 
Gin Cys Val Lys Leu Leu Val Glu Asn Gly Ala Asn Val His Ala Arg 
180 185 190 

gec tgc ggc cgc ttc ttc cag aag ggc caa ggg act tgc ttt tat ttc 984 
Ala Cys Gly Arg Phe Phe Gin Lys Gly Gin Gly Thr Cys Phe Tyr Phe 
195 200 205 

ggt gag eta ccc etc tct ttg gee get tgc ace aag cag tgg gat gtg 1032 
Gly Glu Leu Pro Leu Ser Leu Ala Ala Cys Thr Lys Gin Trp Asp Val 
210 215 220 

gta age tac etc ctg gag aac cca cac cag ccc gee age ctg cag gee 1080 
Val Ser Tyr Leu Leu Glu Asn Pro His Gin Pro Ala Ser Leu Gin Ala 
225 230 235 240 

act gac tec cag ggc aac aca gtc ctg cat gee eta gtg atg ate teg 1128 
Thr Asp Ser Gin Gly Asn Thr Val Leu His Ala Leu Val Met He Ser 
245 250 255 

gac aac tea get gag aac att gca ctg gtg ace age atg tat gat ggg 1176 
Asp Asn Ser Ala Glu Asn He Ala Leu Val Thr Ser Met Tyr Asp Gly 
260 265 270 

etc etc caa get ggg gec cgc etc tgc cct ace gtg cag ctt gag gac 1224 
Leu Leu Gin Ala Gly Ala Arg Leu Cys Pro Thr Val Gin Leu Glu Asp 
275 280 285 

ate cgc aac ctg cag gat etc acg cct ctg aag ctg gee gee aag gag 1272 
He Arg Asn Leu Gin Asp Leu Thr Pro Leu Lys Leu Ala Ala Lys Glu 
290 295 300 

ggc aag ate gag att ttc agg cac ate ctg cag egg gag ttt tea gga 1320 
Gly Lys He Glu He Phe Arg His He Leu Gin Arg Glu Phe Ser Gly 
305 310 315 320 

ctg age cac ctt tec cga aag ttc ace gag tgg tgc tat ggg cct gtc 1368 
Leu Ser His Leu Ser Arg Lys Phe Thr Glu Trp Cys Tyr Gly Pro Val 
325 330 335 

egg gtg teg ctg tat gac ctg get tct gtg gac age tgt gag gag aac 1416 
Arg Val Ser Leu Tyr Asp Leu Ala Ser Val Asp Ser Cys Glu Glu Asn 
340 345 350 

tea gtg ctg gag ate att gec ttt cat tgc aag age ccg cac cga cac 1464 
Ser Val Leu Glu He He Ala Phe His Cys Lys Ser Pro His Arg His 
355 360 365 

cga atg gtc gtt ttg gag ccc ctg aac aaa ctg ctg cag gcg aaa tgg 1512 
Arg Met Val Val Leu Glu Pro Leu Asn Lys Leu Leu Gin Ala Lys Trp 
370 375 380 

gat ctg etc ate ccc aag ttc ttc tta aac ttc ctg tgt aat ctg ate 1560 
Asp Leu Leu He Pro Lys Phe Phe Leu Asn Phe Leu Cys Asn Leu lie 
385 390 395 400 
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tac atg ttc ate ttc acc get gtt gec tac cat cag cct ace ctg aag 

Tyr Met Phe lie Phe Thr Ala Val Ala Tyr His Gin Pro Thr Leu Lys 
405 410 415 

aag cag gee gec cct cac ctg aaa gcg gag gtt gga aac tec atg ctg 

Lys Gin Ala Ala Pro His Leu Lys Ala Glu Val Gly Asn Ser Met Leu 

420 425 430 

ctg acg ggc cac ate ctt ate ctg eta ggg ggg ate tac etc etc gtg 

Leu Thr Gly His lie Leu lie Leu Leu Gly Gly lie Tyr Leu Leu Val 
435 440 445 



1608 



1656 



1704 



ggc cag ctg tgg tac ttc tgg egg cgc cac gtg ttc ate tgg ate teg 1752 

Gly Gin Leu Trp Tyr Phe Trp Arg Arg His Val Phe lie Trp lie Ser 

450 455 460 

ttc ata gac age tac ttt gaa ate etc ttc ctg ttc cag gee ctg etc 1800 

Phe lie Asp Ser Tyr Phe Glu lie Leu Phe Leu Phe Gin Ala Leu Leu 

465 470 475 480 

aca gtg gtg tec cag gtg ctg tgt ttc ctg gee ate gag tgg tac ctg 

Thr Val Val Ser Gin Val Leu Cys Phe Leu Ala lie Glu Trp Tyr Leu 

485 490 495 



1848 



ccc ctg ctt gtg tct gcg ctg gtg ctg ggc tgg ctg aac ctg ctt tac 1896 
Pro Leu Leu Val Ser Ala Leu Val Leu Gly Trp Leu Asn Leu Leu Tyr 
500 505 510 



tat aca cgt ggc ttc cag cac aca ggc ate tac agt gtc atg ate cag 1944 

Tyr Thr Arg Gly Phe Gin His Thr Gly lie Tyr Ser Val Met lie Gin 

515 520 525 

aag gtc ate ctg egg gac ctg ctg cgc ttc ctt ctg ate tac tta gtc 1992 

Lys Val lie Leu Arg Asp Leu Leu Arg Phe Leu Leu lie Tyr Leu Val 
530 535 540 

ttc ctt ttc ggc ttc get gta gee ctg gtg age ctg age cag gag get 2040 

Phe Leu Phe Gly Phe Ala Val Ala Leu Val Ser Leu Ser Gin Glu Ala 

545 550 555 560 



tgg cgc ccc gaa get cct aca ggc ccc aat gee aca gag tea gtg cag 
Trp Arg Pro Glu Ala Pro Thr Gly Pro Asn Ala Thr Glu Ser Val Gin 
565 570 575 



2088 



ccc atg gag gga cag gag gac gag ggc aac ggg gee cag tac agg ggt 2136 

Pro Met Glu Gly Gin Glu Asp Glu Gly Asn Gly Ala Gin Tyr Arg Gly 

580 585 590 

ate ctg gaa gee tec ttg gag etc ttc aaa ttc acc ate ggc atg ggc 2184 

lie Leu Glu Ala Ser Leu Glu Leu Phe Lys Phe Thr lie Gly Met Gly 

595 600 605 



gag ctg gee ttc cag gag cag ctg cac ttc cgc ggc atg gtg ctg ctg 

Glu Leu Ala Phe Gin Glu Gin Leu His Phe Arg Gly Met Val Leu Leu 

610 615 620 

ctg ctg ctg gee tac gtg ctg etc acc tac ate ctg ctg etc aac atg 



2232 



2280 
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Leu Leu Leu Ala Tyr Val Leu Leu Thr Tyr lie Leu Leu Leu Asn Met 
625 630 635 640 

etc ate gee etc atg age gag ace gtc aac agt gtc gee act gac age 2328 
Leu He Ala Leu Met Ser Glu Thr Val Asn Ser Val Ala Thr Asp Ser 
645 650 655 

tgg age ate tgg aag ctg cag aaa gee ate tct gtc ctg gag atg gag 2376 
Trp Ser He Trp Lys Leu Gin Lys Ala He Ser Val Leu Glu Met Glu 
660 665 670 

aat ggc tat tgg tgg tgc agg aag aag cag egg gca ggt gtg atg ctg 2424 
Asn Gly Tyr Trp Trp Cys Arg Lys Lys Gin Arg Ala Gly Val Met Leu 
675 680 685 

acc gtt ggc act aag cca gat ggc age ccg gat gag cgc tgg tgc ttc 2472 
Thr Val Gly Thr Lys Pro Asp Gly Ser Pro Asp Glu Arg Trp Cys Phe 
690 695 700 

agg gtg gag gag gtg aac tgg get tea tgg gag cag acg ctg cct acg 2520 
Arg Val Glu Glu Val Asn Trp Ala Ser Trp Glu Gin Thr Leu Pro Thr 
705 710 715 720 

ctg tgt gag gac ccg tea ggg gca ggt gtc cct cga act etc gag aac 2568 
Leu Cys Glu Asp Pro Ser Gly Ala Gly Val Pro Arg Thr Leu Glu Asn 
725 730 735 

cct gtc ctg get tec cct ccc aag gag gat gag gat ggt gee tct gag 2 616 
Pro Val Leu Ala Ser Pro Pro Lys Glu Asp Glu Asp Gly Ala Ser Glu 
740 745 750 

gaa aac tat gtg ccc gtc cag etc etc cag tec aac tgatggccca 2662 
Glu Asn Tyr Val Pro Val Gin Leu Leu Gin Ser Asn 
755 760 

gatgeagcag gaggecagag gacagagcag aggatctttc caaccacatc tgctggctct 2722 

ggggtcccag tgaattctgg tggcaaatat atattttcac taactcaaaa aaaaaaaaaa 2782 

aaaaaaaaaa aaaaaaaaaa aaaaaaa 2809 



<210> 5 

<211> 764 

<212> PRT 

<213> Homo sapiens 

<400> 5 

Met Thr Ser Pro Ser Ser Ser Pro Val Phe Arg Leu Glu Thr Leu Asp 
15 10 15 

Gly Gly Gin Glu Asp Gly Ser Glu Ala Asp Arg Gly Lys Leu Asp Phe 
20 25 30 

Gly Ser Gly Leu Pro Pro Met Glu Ser Gin Phe Gin Gly Glu Asp Arg 
35 40 45 

Lys Phe Ala Pro Gin He Arg Val Asn Leu Asn Tyr Arg Lys Gly Thr 
50 55 60 
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Gly Ala Ser Gin Pro 
65 

Asn Ala Val Ser Arg 
85 

Tyr Leu Ser Lys Thr 
100 

Gly Ser Thr Gly Lys 
115 

Asp Gly Val Asn Ala 
130 

Ser Gly Asn Pro Gin 
145 

Tyr Arg Gly His Ser 
165 

Gin Cys Val Lys Leu 
180 

Ala Cys Gly Arg Phe 
195 

Gly Glu Leu Pro Leu 
210 

Val Ser Tyr Leu Leu 
225 

Thr Asp Ser Gin Gly 
245 



Asp Pro Asn Arg Phe Asp 
70 75 

Gly Val Pro Glu Asp Leu 
90 

Ser Lys Tyr Leu Thr Asp 
105 

Thr Cys Leu Met Lys Ala 
120 

Cys lie Leu Pro Leu Leu 
135 

Pro Leu Val Asn Ala Gin 
150 155 

Ala Leu His lie Ala lie 
170 

Leu Val Glu Asn Gly Ala 
185 

Phe Gin Lys Gly Gin Gly 
200 

Ser Leu Ala Ala Cys Thr 
215 

Glu Asn Pro His Gin Pro 
230 235 

Asn Thr Val Leu His Ala 
250 



Arg Asp Arg Leu Phe 
80 

Ala Gly Leu Pro Glu 
95 

Ser Glu Tyr Thr Glu 
110 

Val Leu Asn Leu Lys 
125 

Gin lie Asp Arg Asp 
140 

Cys Thr Asp Asp Tyr 
160 

Glu Lys Arg Ser Leu 
175 

Asn Val His Ala Arg 
190 

Thr Cys Phe Tyr Phe 
205 

Lys Gin Trp Asp Val 
220 

Ala Ser Leu Gin Ala 
240 

Leu Val Met lie Ser 
255 



Asp Asn Ser Ala 
260 

Leu Leu Gin Ala 
275 

lie Arg Asn Leu 
290 

Gly Lys lie Glu 
305 

Leu Ser His Leu 



Arg Val Ser Leu 
340 

Ser Val Leu Glu 
355 



Glu Asn lie Ala 



Gly Ala Arg Leu 
280 

Gin Asp Leu Thr 
295 

lie Phe Arg His 
310 

Ser Arg Lys Phe 
325 

Tyr Asp Leu Ala 



lie He Ala Phe 
360 



Leu Val Thr Ser 
265 

Cys Pro Thr Val 



Pro Leu Lys Leu 
300 

He Leu Gin Arg 
315 

Thr Glu Trp Cys 
330 

Ser Val Asp Ser 
345 

His Cys Lys Ser 



Met Tyr Asp Gly 
270 

Gin Leu Glu Asp 
285 

Ala Ala Lys Glu 



Glu Phe Ser Gly 
320 

Tyr Gly Pro Val 
335 

Cys Glu Glu Asn 
350 

Pro His Arg His 
365 
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Arg Met Val Val Leu Glu Pro Leu Asn Lys Leu Leu Gin Ala Lys Trp 
370 375 380 

Asp Leu Leu lie Pro Lys Phe Phe Leu Asn Phe Leu Cys Asn Leu lie 
385 390 395 400 

Tyr Met Phe lie Phe Thr Ala Val Ala Tyr His Gin Pro Thr Leu Lys 
405 410 415 

Lys Gin Ala Ala Pro His Leu Lys Ala Glu Val Gly Asn Ser Met Leu 
420 425 430 

Leu Thr Gly His lie Leu lie Leu Leu Gly Gly lie Tyr Leu Leu Val 
435 440 445 

Gly Gin Leu Trp Tyr Phe Trp Arg Arg His Val Phe lie Trp lie Ser 
450 455 460 

Phe lie Asp Ser Tyr Phe Glu lie Leu Phe Leu Phe Gin Ala Leu Leu 
465 470 475 480 

Thr Val Val Ser Gin Val Leu Cys Phe Leu Ala lie Glu Trp Tyr Leu 
485 490 495 

Pro Leu Leu Val Ser Ala Leu Val Leu Gly Trp Leu Asn Leu Leu Tyr 
500 505 510 

Tyr Thr Arg Gly Phe Gin His Thr Gly lie Tyr Ser Val Met lie Gin 
515 520 525 

Lys Val lie Leu Arg Asp Leu Leu Arg Phe Leu Leu lie Tyr Leu Val 
530 535 540 

Phe Leu Phe Gly Phe Ala Val Ala Leu Val Ser Leu Ser Gin Glu Ala 
545 550 555 560 

Trp Arg Pro Glu Ala Pro Thr Gly Pro Asn Ala Thr Glu Ser Val Gin 
565 570 575 

Pro Met Glu Gly Gin Glu Asp Glu Gly Asn Gly Ala Gin Tyr Arg Gly 
580 585 590 

lie Leu Glu Ala Ser Leu Glu Leu Phe Lys Phe Thr lie Gly Met Gly 
595 600 605 

Glu Leu Ala Phe Gin Glu Gin Leu His Phe Arg Gly Met Val Leu Leu 
610 615 620 

Leu Leu Leu Ala Tyr Val Leu Leu Thr Tyr lie Leu Leu Leu Asn Met 
625 630 635 640 

Leu lie Ala Leu Met Ser Glu Thr Val Asn Ser Val Ala Thr Asp Ser 
645 650 655 

Trp Ser lie Trp Lys Leu Gin Lys Ala lie Ser Val Leu Glu Met Glu 
660 665 670 

Asn Gly Tyr Trp Trp Cys Arg Lys Lys Gin Arg Ala Gly Val Met Leu 
675 680 685 
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Thr Val Gly Thr 
690 

Arg Val Glu Glu 
705 

Leu Cys Glu Asp 



Pro Val Leu Ala 
740 

Glu Asn Tyr Val 
755 



Lys Pro Asp Gly 
695 

Val Asn Trp Ala 
710 

Pro Ser Gly Ala 
725 

Ser Pro Pro Lys 



Pro Val Gin Leu 
760 



Ser Pro Asp Glu 
700 

Ser Trp Glu Gin 
715 

Gly Val Pro Arg 
730 

Glu Asp Glu Asp 
745 

Leu Gin Ser Asn 



Arg Trp Cys Phe 



Thr Leu Pro Thr 
720 

Thr Leu Glu Asn 
735 

Gly Ala Ser Glu 
750 



<210> 6 
<211> 2292 
<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1) . . (2292) 

<400> 6 

atg acc tea ccc tec age tct cca gtt ttc agg ttg gag aca tta gat 48 

Met Thr Ser Pro Ser Ser Ser Pro Val Phe Arg Leu Glu Thr Leu Asp 

15 10 15 

gga ggc caa gaa gat ggc tct gag gcg gac aga gga aag ctg gat ttt 96 
Gly Gly Gin Glu Asp Gly Ser Glu Ala Asp Arg Gly Lys Leu Asp Phe 
20 25 30 



ggg age ggg ctg cct ccc atg gag tea cag ttc cag ggc gag gac egg 144 
Gly Ser Gly Leu Pro Pro Met Glu Ser Gin Phe Gin Gly Glu Asp Arg 
35 40 45 

aaa ttc gee cct cag ata aga gtc aac etc aac tac cga aag gga aca 192 
Lys Phe Ala Pro Gin lie Arg Val Asn Leu Asn Tyr Arg Lys Gly Thr 
50 55 60 

ggt gee agt cag ccg gat cca aac cga ttt gac cga gat egg etc ttc 240 
Gly Ala Ser Gin Pro Asp Pro Asn Arg Phe Asp Arg Asp Arg Leu Phe 
65 70 75 80 

aat gcg gtc tec egg ggt gtc ccc gag gat ctg get gga ctt cca gag 288 
Asn Ala Val Ser Arg Gly Val Pro Glu Asp Leu Ala Gly Leu Pro Glu 
85 90 95 

tac ctg age aag acc age aag tac etc acc gac teg gaa tac aca gag 336 
Tyr Leu Ser Lys Thr Ser Lys Tyr Leu Thr Asp Ser Glu Tyr Thr Glu 
100 105 110 

ggc tec aca ggt aag acg tgc ctg atg aag get gtg ctg aac ctt aag 384 
Gly Ser Thr Gly Lys Thr Cys Leu Met Lys Ala Val Leu Asn Leu Lys 
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115 



120 



125 



gac gga gtc aat gcc tgc att ctg cca ctg ctg cag ate gac agg gac 432 
Asp Gly Val Asn Ala Cys lie Leu Pro Leu Leu Gin lie Asp Arg Asp 
130 135 140 

tct ggc aat cct cag ccc ctg gta aat gcc cag tgc aca gat gac tat 480 
Ser Gly Asn Pro Gin Pro Leu Val Asn Ala Gin Cys Thr Asp Asp Tyr 
145 150 155 160 

tac cga ggc cac age get ctg cac ate gcc att gag aag agg agt ctg 528 
Tyr Arg Gly His Ser Ala Leu His lie Ala lie Glu Lys Arg Ser Leu 
165 170 175 

cag tgt gtg aag etc ctg gtg gag aat ggg gcc aat gtg cat gcc egg 576 
Gin Cys Val Lys Leu Leu Val Glu Asn Gly Ala Asn Val His Ala Arg 
180 185 190 

gcc tgc ggc cgc ttc ttc cag aag ggc caa ggg act tgc ttt tat ttc 624 
Ala Cys Gly Arg Phe Phe Gin Lys Gly Gin Gly Thr Cys Phe Tyr Phe 
195 200 205 

ggc gag eta ccc etc tct ttg gcc get tgc acc aag cag tgg gat gtg 672 
Gly Glu Leu Pro Leu Ser Leu Ala Ala Cys Thr Lys Gin Trp Asp Val 
210 215 220 

gta age tac etc ctg gag aac cca cac cag ccc gcc age ctg cag gcc 720 
Val Ser Tyr Leu Leu Glu Asn Pro His Gin Pro Ala Ser Leu Gin Ala 
225 230 235 240 

act gac tec cag ggc aac aca gtc ctg cat gcc eta gtg atg ate teg 768 
Thr Asp Ser Gin Gly Asn Thr Val Leu His Ala Leu Val Met lie Ser 
245 250 255 

gac aac tea get gag aac att gca ctg gtg acc age atg tat gat ggg 816 
Asp Asn Ser Ala Glu Asn lie Ala Leu Val Thr Ser Met Tyr Asp Gly 
260 265 270 



etc etc caa get ggg gcc cgc etc tgc cct acc gtg cag ctt gag gac 864 

Leu Leu Gin Ala Gly Ala Arg Leu Cys Pro Thr Val Gin Leu Glu Asp 

275 280 285 

ate cgc aac ctg cag gat etc acg cct ctg aag ctg gcc gcc aag gag 912 

lie Arg Asn Leu Gin Asp Leu Thr Pro Leu Lys Leu Ala Ala Lys Glu 
290 295 300 

ggc aag ate gag att ttc agg cac ate ctg cag egg gag ttt tea gga 960 

Gly Lys lie Glu lie Phe Arg His lie Leu Gin Arg Glu Phe Ser Gly 
305 310 315 320 

ctg age cac ctt tec cga aag ttc acc gag tgg tgc tat ggg cct gtc 1008 

Leu Ser His Leu Ser Arg Lys Phe Thr Glu Trp Cys Tyr Gly Pro Val 

325 330 335 

egg gtg teg ctg tat gac ctg get tct gtg gac age tgt gag gag aac 1056 

Arg Val Ser Leu Tyr Asp Leu Ala Ser Val Asp Ser Cys Glu Glu Asn 
340 345 350 
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tea gtg ctg 
Ser Val Leu 
355 

cga atg gtc 
Arg Met Val 
370 

gat ctg etc 
Asp Leu Leu 
385 

tac atg ttc 
Tyr Met Phe 



aag cag gee 
Lys Gin Ala 



ctg acg ggc 
Leu Thr Gly 
435 

ggc cag ctg 
Gly Gin Leu 
450 

ttc ata gac 
Phe lie Asp 
465 



aca gtg gtg 
Thr Val Val 



ccc ctg ctt 
Pro Leu Leu 



tat aca cgt 
Tyr Thr Arg 
515 

aag gtc ate 
Lys Val He 
530 

ttc ctt ttc 
Phe Leu Phe 
545 

tgg cgc ccc 
Trp Arg Pro 



ccc atg gag 
Pro Met Glu 



gag ate att 
Glu He He 



gtt ttg gag 
Val Leu Glu 



ate ccc aag 
He Pro Lys 
390 

ate ttc acc 
He Phe Thr 
405 

gec cct cac 
Ala Pro His 
420 

cac ate ctt 
His He Leu 



tgg tac ttc 
Trp Tyr Phe 



age tac ttt 
Ser Tyr Phe 
470 



tec cag gtg 
Ser Gin Val 
485 

gtg tct gcg 
Val Ser Ala 
500 

ggc ttc cag 
Gly Phe Gin 



ctg egg gac 
Leu Arg Asp 



ggc ttc get 
Gly Phe Ala 
550 

gaa get cct 
Glu Ala Pro 
565 

gga cag gag 
Gly Gin Glu 



gee ttt cat 
Ala Phe His 
360 

ccc ctg aac 
Pro Leu Asn 
375 

ttc ttc tta 
Phe Phe Leu 



get gtt gee 
Ala Val Ala 



ctg aaa gcg 
Leu Lys Ala 
425 

ate ctg eta 
lie Leu Leu 
440 

tgg egg cgc 
Trp Arg Arg 
455 

gaa ate etc 
Glu lie Leu 



ctg tgt ttc 
Leu Cys Phe 



ctg gtg ctg 
Leu Val Leu 
505 

cac aca ggc 
His Thr Gly 
520 

ctg ctg cgc 
Leu Leu Arg 
535 

gta gee ctg 
Val Ala Leu 



aca ggc ccc 
Thr Gly Pro 



gac gag ggc 
Asp Glu Gly 



-21 - 

tgc aag age 
Cys Lys Ser 



aaa ctg ctg 
Lys Leu Leu 
380 

aac ttc ctg 
Asn Phe Leu 
395 

tac cat cag 
Tyr His Gin 
410 

gag gtt gga 
Glu Val Gly 



ggg ggg ate 
Gly Gly He 



cac gtg ttc 
His Val Phe 
4 60 

ttc ctg ttc 
Phe Leu Phe 
475 



ctg gee ate 
Leu Ala He 
490 

ggc tgg ctg 
Gly Trp Leu 



ate tac agt 
lie Tyr Ser 



ttc ctt ctg 
Phe Leu Leu 
540 

gtg age ctg 
Val Ser Leu 
555 

aat gee aca 
Asn Ala Thr 
570 

aac ggg gee 
Asn Gly Ala 



ccg cac cga 
Pro His Arg 
365 

cag gcg aaa 
Gin Ala Lys 



tgt aat ctg 
Cys Asn Leu 



cct acc ctg 
Pro Thr Leu 
415 

aac tec atg 
Asn Ser Met 
430 

tac etc etc 
Tyr Leu Leu 
445 

ate tgg ate 
He Trp He 



cag gee ctg 
Gin Ala Leu 



gag tgg tac 
Glu Trp Tyr 
495 

aac ctg ctt 
Asn Leu Leu 
510 

gtc atg ate 
Val Met He 
525 

ate tac tta 
He Tyr Leu 



age cag gag 
Ser Gin Glu 



gag tea gtg 
Glu Ser Val 
575 

cag tac agg 
Gin Tyr Arg 



cac 1104 
His 



tgg 1152 
Trp 



ate 1200 

He 

400 

aag 1248 
Lys 



ctg 1296 
Leu 



gtg 1344 
Val 



teg 1392 
Ser 



etc 1440 

Leu 

480 



ctg 1488 
Leu 



tac 1536 
Tyr 



cag 1584 
Gin 



gtc 1632 
Val 



get 1680 

Ala 

560 

cag 1728 
Gin 



g-gt 1776 
Gly 
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580 585 590 

ate ctg gaa gec tec ttg gag etc ttc aaa ttc ace ate ggc atg ggc 1824 
lie Leu Glu Ala Ser Leu Glu Leu Phe Lys Phe Thr lie Gly Met Gly 
595 600 605 

gag ctg gec ttc cag gag cag ctg cac ttc cgc ggc atg gtg ctg ctg 1872 
Glu Leu Ala Phe Gin Glu Gin Leu His Phe Arg Gly Met Val Leu Leu 
610 615 620 

ctg ctg ctg gec tac gtg ctg etc acc tac ate ctg ctg etc aac atg 1920 
Leu Leu Leu Ala Tyr Val Leu Leu Thr Tyr lie Leu Leu Leu Asn Met 
625 630 635 640 

etc ate gec etc atg age gag acc gtc aac agt gtc gec act gac age 1968 
Leu lie Ala Leu Met Ser Glu Thr Val Asn Ser Val Ala Thr Asp Ser 
645 650 655 

tgg age ate tgg aag ctg cag aaa gee ate tct gtc ctg gag atg gag 2016 
Trp Ser lie Trp Lys Leu Gin Lys Ala lie Ser Val Leu Glu Met Glu 
660 665 670 

aat ggc tat tgg tgg tgc agg aag aag cag egg gca ggt gtg atg ctg 2064 
Asn Gly Tyr Trp Trp Cys Arg Lys Lys Gin Arg Ala Gly Val Met Leu 
675 680 685 

acc gtt ggc act aag cca gat ggc age ccg gat gag cgc tgg tgc ttc 2112 
Thr Val Gly Thr Lys Pro Asp Gly Ser Pro Asp Glu Arg Trp Cys Phe 
690 695 700 



agg gtg gag gag gtg aac tgg get tea 
Arg Val Glu Glu Val Asn Trp Ala Ser 
705 710 

ctg tgt gag gac ccg tea ggg gca ggt 
Leu Cys Glu Asp Pro Ser Gly Ala Gly 
725 

cct gtc ctg get tec cct ccc aag gag 
Pro Val Leu Ala Ser Pro Pro Lys Glu 
740 745 

gaa aac tat gtg ccc gtc cag etc etc 
Glu Asn Tyr Val Pro Val Gin Leu Leu 
755 760 



tgg gag cag acg ctg cct acg 2160 
Trp Glu Gin Thr Leu Pro Thr 
715 720 

gtc cct cga act etc gag aac 2208 
Val Pro Arg Thr Leu Glu Asn 
730 735 

gat gag gat ggt gec tct gag 2256 
Asp Glu Asp Gly Ala Ser Glu 
750 

cag tec aac 2292 
Gin Ser Asn 



<210> 7 
<211> 1489 
<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (3) . . (1310) 

<400> 7 
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gc ggc cgc ttc ttc cag aag ggc caa ggg act tgc ttt tat ttc ggt 47 

Gly Arg Phe Phe Gin Lys Gly Gin Gly Thr Cys Phe Tyr Phe Gly 
15 10 15 

gag eta ccc etc tct ttg gec get tgc acc aag cag tgg gat gtg gta 95 
Glu Leu Pro Leu Ser Leu Ala Ala Cys Thr Lys Gin Trp Asp Val Val 
20 25 30 

age tac etc ctg gag aac cca cac cag ccc gec age ctg cag gee act 143 
Ser Tyr Leu Leu Glu Asn Pro His Gin Pro Ala Ser Leu Gin Ala Thr 
35 40 45 

gac tec cag ggc aac aca gtc ctg cat gec eta gtg atg ate teg gac 191 
Asp Ser Gin Gly Asn Thr Val Leu His Ala Leu Val Met lie Ser Asp 
50 55 60 

aac tea get gag aac att gca ctg gtg acc age atg tat gat ggg etc 239 
Asn Ser Ala Glu Asn lie Ala Leu Val Thr Ser Met Tyr Asp Gly Leu 
65 70 75 



etc caa get ggg gec cgc etc tgc cct acc gtg cag ctt gag gac ate 
Leu Gin Ala Gly Ala Arg Leu Cys Pro Thr Val Gin Leu Glu Asp lie 
80 85 90 95 



287 



cgc aac ctg cag gat etc acg cct ctg aag ctg gec gec aag gag ggc 335 
Arg Asn Leu Gin Asp Leu Thr Pro Leu Lys Leu Ala Ala Lys Glu Gly 
100 105 110 



431 



479 



527 



aag ate gag att ttc agg cac ate ctg cag egg gag ttt tea gga ctg 383 
Lys He Glu He Phe Arg His lie Leu Gin Arg Glu Phe Ser Gly Leu 
115 120 125 

age cac ctt tec cga aag ttc acc gag tgg tgc tat ggg cct gtc egg 
Ser His Leu Ser Arg Lys Phe Thr Glu Trp Cys Tyr Gly Pro Val Arg 
130 135 140 

gtg teg ctg tat gac ctg get tct gtg gac age tgt gag gag aac tea 
Val Ser Leu Tyr Asp Leu Ala Ser Val Asp Ser Cys Glu Glu Asn Ser 
145 150 155 

gtg ctg gag ate att gee ttt cat tgc aag age ccg cac cga cac cga 
Val Leu Glu He He Ala Phe His Cys Lys Ser Pro His Arg His Arg 
160 165 170 175 

atg gtc gtt ttg gag ccc ctg aac aaa ctg ctg cag gcg aaa tgg gat 575 
Met Val Val Leu Glu Pro Leu Asn Lys Leu Leu Gin Ala Lys Trp Asp 
180 185 190 

ctg etc ate ccc aag ttc ttc tta aac ttc ctg tgt aat ctg ate tac 623 
Leu Leu He Pro Lys Phe Phe Leu Asn Phe Leu Cys Asn Leu lie Tyr 
195 200 205 

atg ttc ate ttc acc get gtt gec tac cat cag cct acc ctg aag aag 
Met Phe He Phe Thr Ala Val Ala Tyr His Gin Pro Thr Leu Lys Lys 
210 215 220 

cag gec gee cct cac ctg aaa gcg gag gtt gga aac tec atg ctg ctg 719 
Gin Ala Ala Pro His Leu Lys Ala Glu Val Gly Asn Ser Met Leu Leu 



671 
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225 230 

acg ggc cac ate ctt ate ctg eta ggg 
Thr Gly His lie Leu lie Leu Leu Gly 
240 245 

cag ctg tgg tac ttc tgg egg cgc cac 
Gin Leu Trp Tyr Phe Trp Arg Arg His 
260 

ata gac age tac ttt gaa ate etc ttc 
lie Asp Ser Tyr Phe Glu lie Leu Phe 
275 280 

gtg gtg tec cag gtg ctg tgt ttc ctg 
Val Val Ser Gin Val^Leu Cys Phe Leu 
290 295 

ctg ctt gtg tct gcg ctg gtg ctg ggc 
Leu Leu Val Ser Ala Leu Val Leu Gly 
305 310 

aca cgt ggc ttc cag cac aca ggc ate 
Thr Arg Gly Phe Gin His Thr Gly lie 
320 325 



-24- 

235 

ggg ate tac etc etc gtg ggc 767 
Gly lie Tyr Leu Leu Val Gly 
250 255 

gtg ttc ate tgg ate teg ttc 815 
Val Phe lie Trp lie Ser Phe 
265 270 

ctg ttc cag gec ctg etc aca 863 
Leu Phe Gin Ala Leu Leu Thr 
285 

gec ate gag tgg tac ctg ccc 911 
Ala lie Glu Trp Tyr Leu Pro 
300 

tgg ctg aac ctg ctt tac tat 959 
Trp Leu Asn Leu Leu Tyr Tyr 
315 

tac agt gtc atg ate cag aag 1007 
Tyr Ser Val Met lie Gin Lys 
330 335 



aaa gee ate tct gtc ctg gag atg gag aat ggc tat tgg tgg tgc agg 1055 
Lys Ala lie Ser Val Leu Glu Met Glu Asn Gly Tyr Trp Trp Cys Arg 
340 345 350 

aag aag cag egg gca ggt gtg atg ctg ace gtt ggc act aag cca gat 1103 
Lys Lys Gin Arg Ala Gly Val Met Leu Thr Val Gly Thr Lys Pro Asp 
355 360 365 

ggc age ccg gat gag cgc tgg tgc ttc agg gtg gag gag gtg aac tgg 1151 
Gly Ser Pro Asp Glu Arg Trp Cys Phe Arg Val Glu Glu Val Asn Trp 
370 375 380 

get tea tgg gag cag acg ctg cct acg ctg tgt gag gac ccg tea ggg 1199 
Ala Ser Trp Glu Gin Thr Leu Pro Thr Leu Cys Glu Asp Pro Ser Gly 
385 390 395 

gca ggt gtc cct cga act etc gag aac cct gtc ctg get tec cct ccc 1247 
Ala Gly Val Pro Arg Thr Leu Glu Asn Pro Val Leu Ala Ser Pro Pro 
400 405 410 415 

aag gag gat gag gat ggt gee tct gag gaa aac tat gtg ccc gtc cag 1295 
Lys Glu Asp Glu Asp Gly Ala Ser Glu Glu Asn Tyr Val Pro Val Gin 
420 425 430 

etc etc cag tec aac tgatg-gecca gatgeagcag gaggecagag gacagagcag 1350 
Leu Leu Gin Ser Asn 
435 

aggatctttc caaccacatc tgctggctct ggggtcccag tgaattctgg tggcaaatat 1410 
atattttcac taactcaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaagg 1470 
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agcggacgcg tgggtcgac 1489 



<210> 8 
<211> 436 
<212> PRT 

<213> Homo sapiens 
<400> 8 

Gly Arg Phe Phe Gin Lys Gly Gin Gly Thr Cys Phe Tyr Phe Gly Glu 
15 10 15 

Leu Pro Leu Ser Leu Ala Ala Cys Thr Lys Gin Trp Asp Val Val Ser 
20 25 30 

Tyr Leu Leu Glu Asn Pro His Gin Pro Ala Ser Leu Gin Ala Thr Asp 
35 40 45 

Ser Gin Gly Asn Thr Val Leu His Ala Leu Val Met lie Ser Asp Asn 
50 55 60 

Ser Ala Glu Asn lie Ala Leu Val Thr Ser Met Tyr Asp Gly Leu Leu 
65 70 75 80 



Gin Ala Gly Ala 



Asn Leu Gin Asp 
100 

He Glu He Phe 
115 

His Leu Ser Arg 
130 

Ser Leu Tyr Asp 
145 

Leu Glu He He 



Val Val Leu Glu 
180 

Leu lie Pro Lys 
195 

Phe He Phe Thr 
210 

Ala Ala Pro His 
225 

Gly His He Leu 



Leu Trp Tyr Phe 



Arg Leu Cys Pro 
85 

Leu Thr Pro Leu 



Arg His He Leu 
120 

Lys Phe Thr Glu 
135 

Leu Ala Ser Val 
150 

Ala Phe His Cys 
165 

Pro Leu Asn Lys 



Phe Phe Leu Asn 
200 

Ala Val Ala Tyr 
215 

Leu Lys Ala Glu 
230 

lie Leu Leu Gly 
245 

Trp Arg Arg His 



Thr Val Gin Leu 
90 

Lys Leu Ala Ala 
105 

Gin Arg Glu Phe 



Trp Cys Tyr Gly 
140 

Asp Ser Cys Glu 
155 

Lys Ser Pro His 
170 

Leu Leu Gin Ala 
185 

Phe Leu Cys Asn 



His Gin Pro Thr 
220 

Val Gly Asn Ser 
235 

Gly lie Tyr Leu 
250 

Val Phe lie Trp 



Glu Asp lie Arg 
95 

Lys Glu Gly Lys 
110 

Ser Gly Leu Ser 
125 

Pro Val Arg Val 



Glu Asn Ser Val 
160 

Arg His Arg Met 
175 

Lys Trp Asp Leu 
190 

Leu He Tyr Met 
205 

Leu Lys Lys Gin 



Met Leu Leu Thr 
240 

Leu Val Gly Gin 
255 

He Ser Phe lie 
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260 265 270 

Asp Ser Tyr Phe Glu lie Leu Phe Leu Phe Gin Ala Leu Leu Thr Val 
275 280 285 

Val Ser Gin Val Leu Cys Phe Leu Ala lie Glu Trp Tyr Leu Pro Leu 
290 295 300 

Leu Val Ser Ala Leu Val Leu Gly Trp Leu Asn Leu Leu Tyr Tyr Thr 
305 310 315 320 

Arg Gly Phe Gin His Thr Gly lie Tyr Ser Val Met lie Gin Lys Lys 
325 330 335 

Ala lie Ser Val Leu Glu Met Glu Asn Gly Tyr Trp Trp Cys Arg Lys 
340 345 350 

Lys Gin Arg Ala Gly Val Met Leu Thr Val Gly Thr Lys Pro Asp Gly 
355 360 365 

Ser Pro Asp Glu Arg Trp Cys Phe Arg Val Glu Glu Val Asn Trp Ala 
370 375 380 

Ser Trp Glu Gin Thr Leu Pro Thr Leu Cys Glu Asp Pro Ser Gly Ala 
385 390 395 400 

Gly Val Pro Arg Thr Leu Glu Asn Pro Val Leu Ala Ser Pro Pro Lys 
405 410 415 

Glu Asp Glu Asp Gly Ala Ser Glu Glu Asn Tyr Val Pro Val Gin Leu 
420 425 430 



Leu Gin Ser Asn 
435 



<210> 9 
<211> 1308 
<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1) . . (1308) 

<400> 9 

ggc cgc ttc ttc cag aag ggc caa ggg act tgc ttt tat ttc ggt gag 48 

Gly Arg Phe Phe Gin Lys Gly Gin Gly Thr Cys Phe Tyr Phe Gly Glu 
15 10 15 



eta ccc etc tct ttg gee get tgc ace aag cag tgg gat gtg gta age 96 
Leu Pro Leu Ser Leu Ala Ala Cys Thr Lys Gin Trp Asp Val Val Ser 
20 25 30 

tac etc ctg gag aac cca cac cag ccc gec age ctg cag gee act gac 144 
Tyr Leu Leu Glu Asn Pro His Gin Pro Ala Ser Leu Gin Ala Thr Asp 
35 40 45 
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tec cag ggc aac aca gtc ctg cat gec eta gtg atg'atc teg gac aac 192 
Ser Gin Gly Asn Thr Val Leu His Ala Leu Val Met lie Ser Asp Asn 
50 55 60 

tea get gag aac att gca ctg gtg ace age atg tat gat ggg etc etc 240 
Ser Ala Glu Asn lie Ala Leu Val Thr Ser Met Tyr Asp Gly Leu Leu 
65 70 75 80 

caa get ggg gee cgc etc tgc cct acc gtg cag ctt gag gac ate cgc 288 
Gin Ala Gly Ala Arg Leu Cys Pro Thr Val Gin Leu Glu Asp lie Arg 
85 90 95 

aac ctg cag gat etc acg cct ctg aag ctg gee gec aag gag ggc aag 336 
Asn Leu Gin Asp Leu Thr Pro Leu Lys Leu Ala Ala Lys Glu Gly Lys 
100 105 110 

ate gag att ttc agg cac ate ctg cag egg gag ttt tea gga ctg age 384 
lie Glu lie Phe Arg His lie Leu Gin Arg Glu Phe Ser Gly Leu Ser 
115 120 125 

cac ctt tec cga aag ttc acc gag tgg tgc tat ggg cct gtc egg gtg 432 
His Leu Ser Arg Lys Phe Thr Glu Trp Cys Tyr Gly Pro Val Arg Val 
130 135 140 

teg ctg tat gac ctg get tct gtg gac age tgt gag gag aac tea gtg 480 
Ser Leu Tyr Asp Leu Ala Ser Val Asp Ser Cys Glu Glu Asn Ser Val 
145 150 155 160 

ctg gag ate att gee ttt cat tgc aag age ccg cac cga cac cga atg 528 
Leu Glu lie lie Ala Phe His Cys Lys Ser Pro His Arg His Arg Met 
165 170 175 

gtc gtt ttg gag ccc ctg aac aaa ctg ctg cag gcg aaa tgg gat ctg 576 
Val Val Leu Glu Pro Leu Asn Lys Leu Leu Gin Ala Lys Trp Asp Leu 
180 185 190 

etc ate ccc aag ttc ttc tta aac ttc ctg tgt aat ctg ate tac atg 624 
Leu lie Pro Lys Phe Phe Leu Asn Phe Leu Cys Asn Leu lie Tyr Met 
195 200 205 

ttc ate ttc acc get gtt gec tac cat cag cct acc ctg aag aag cag 672 
Phe lie Phe Thr Ala Val Ala Tyr His Gin Pro Thr Leu Lys Lys Gin 
210 215 220 

gee gec cct cac ctg aaa gcg gag gtt gga aac tec atg ctg ctg acg 720 
Ala Ala Pro His Leu Lys Ala Glu Val Gly Asn Ser Met Leu Leu Thr 
225 230 235 240 

ggc cac ate ctt ate ctg eta ggg ggg ate tac etc etc gtg ggc cag 768 
Gly His lie Leu lie Leu Leu Gly Gly lie Tyr Leu Leu Val Gly Gin 
245 250 255 



ctg tgg tac ttc tgg egg cgc cac 
Leu Trp Tyr Phe Trp Arg Arg His 
260 

gac age tac ttt gaa ate etc ttc 
Asp Ser Tyr Phe Glu lie Leu Phe 



gtg ttc ate tgg ate teg ttc ata 816 
Val Phe lie Trp lie Ser Phe lie 
265 270 

ctg ttc cag gec ctg etc aca gtg 864 
Leu Phe Gin Ala Leu Leu Thr Val 
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275 280 285 

gtg tec cag gtg ctg tgt ttc ctg gec ate gag tgg tac ctg ccc ctg 912 
Val Ser Gin Val Leu Cys Phe Leu Ala lie Glu Trp Tyr Leu Pro Leu 
290 295 300 

ctt gtg tct gcg ctg gtg ctg ggc tgg ctg aac ctg ctt tac tat aca 960 
Leu Val Ser Ala Leu Val Leu Gly Trp Leu Asn Leu Leu Tyr Tyr Thr 
305 310 315 320 

cgt ggc ttc cag cac aca ggc ate tac agt gtc atg ate cag aag aaa 1008 
Arg Gly Phe Gin His Thr Gly lie Tyr Ser Val Met lie Gin Lys Lys 
325 330 335 

gec ate tct gtc ctg gag atg gag aat ggc tat tgg tgg tgc agg aag 1056 
Ala lie Ser Val Leu Glu Met Glu Asn Gly Tyr Trp Trp Cys Arg Lys 
340 345 350 



aag cag egg gca ggt gtg atg ctg ace gtt ggc act aag cca gat ggc 1104 
Lys Gin Arg Ala Gly Val Met Leu Thr Val Gly Thr Lys Pro Asp Gly 
355 360 365 

age ccg gat gag cgc tgg tgc ttc agg gtg gag gag gtg aac tgg get 1152 
Ser Pro Asp Glu Arg Trp Cys Phe Arg Val Glu Glu Val Asn Trp Ala 
370 375 380 

tea tgg gag cag acg ctg cct acg ctg tgt gag gac ccg tea ggg gca 1200 
Ser Trp Glu Gin Thr Leu Pro Thr Leu Cys Glu Asp Pro Ser Gly Ala 
385 390 395 400 

ggt gtc cct cga act etc gag aac cct gtc ctg get tec cct ccc aag 1248 
Gly Val Pro Arg Thr Leu Glu Asn Pro Val Leu Ala Ser Pro Pro Lys 
405 410 415 

gag gat gag gat ggt gee tct gag gaa aac tat gtg ccc gtc cag etc 1296 
Glu Asp Glu Asp Gly Ala Ser Glu Glu Asn Tyr Val Pro Val Gin Leu 
420 425 430 

etc cag tec aac 1308 
Leu Gin Ser Asn 
435 



<210> 10 
<211> 1794 
<212> DNA 
<213> Rattus sp. 
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<220> 

<221> CDS 

<222> (2) . . (1663) 

<400> 10 

g teg acc cac gcg tec get ctt tct ctg get gcg tgc ace aag cag tgg 49 

Ser Thr His Ala Ser Ala Leu Ser Leu Ala Ala Cys Thr Lys Gin Trp 
15 10 15 

gat gtg gtg acc tac etc ctg gag aac cca cac cag ccg gee age ctg 97 
Asp Val Val Thr Tyr Leu Leu Glu Asn Pro His Gin Pro Ala Ser Leu 
20 25 30 

gag qcc acc gac tec ctg ggc aac aca gtc ctg cat get ctg gta atg 145 
Glu Ala Thr Asp Ser Leu Gly Asn Thr Val Leu His Ala Leu Val Met 
35 40 45 

att cca gat aac teg cct gag aac agt gec ctg gtg ate cac atg tac 193 
He Ala Asp Asn Ser Pro Glu Asn Ser Ala Leu Val lie His Met Tyr 
50 55 60 

no c ac:q ctt eta caa atg ggg gcg cgc etc tgc ccc act gtg cag ctt 241 
Asp G)y Leu Leu Gin Met Gly Ala Arg Leu Cys Pro Thr Val Gin Leu 
65 70 75 80 

gag gaa ate tec aac cac caa ggc etc aca ccc ctg aaa eta gec gee 289 
Glu Glu He Ser Asn His Gin Gly Leu Thr Pro Leu Lys Leu Ala Ala 
85 90 95 

aag gaa ggc aaa ate gag att ttc agg cac att ctg cag egg gaa ttc 337 
Lys Glu Gly Lys He Glu He Phe Arg His lie Leu Gin Arg Glu Phe 
100 105 110 

tea gga ccg tac cag ccc ctt tec cga aag ttt act gag tgg tgt tac 385 
Ser Gly Pro Tyr Gin Pro Leu Ser Arg Lys Phe Thr Glu Trp Cys Tyr 
115 120 125 

ggt cct gtg egg gta teg ctg tac gac ctg tec tct gtg gac age tgg 433 
Gly Pro Val Arg Val Ser Leu Tyr Asp Leu Ser Ser Val Asp Ser Trp 
130 135 140 

gaa aag aac teg gtg ctg gag ate ate get ttt cat tgc aag age ccg 481 
Glu Lys Asn Ser Val Leu Glu He lie Ala Phe His Cys Lys Ser Pro 
145 150 155 160 

aac egg cac cgc atg gtg gtt tta gaa cca ctg aac aag ctt ctg cag 529 
Asn Arg His Arg Met Val Val Leu Glu Pro Leu Asn Lys Leu Leu Gin 
165 170 175 

gag aaa tgg gat egg etc gtc tea aga ttc ttc ttc aac ttc gee tgc 577 
Glu Lys Trp Asp Arg Leu Val Ser Arg Phe Phe Phe Asn Phe Ala Cys 
180 185 190 

tac ttg gtc tac atg ttc ate ttc acc gtc gtt gee tac cac cag cct 625 
Tyr Leu Val Tyr Met Phe lie Phe Thr Val Val Ala Tyr His Gin Pro 
195 200 205 

tec ctg gat cag cca gee ate ccc tea tea aaa gcg act ttt ggg gaa 673 
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Ser Leu Asp Gin Pro Ala lie Pro Ser Ser Lys Ala Thr Phe Gly Glu 
210 215 220 

tec atg ctg ctg ctg ggc cac att ctg ate ctg ctt ggg ggt att tac 721 
Ser Met Leu Leu Leu Gly His lie Leu lie Leu Leu Gly Gly lie Tyr 
225 230 235 240 

etc tta ctg ggc cag ctg tgg tac ttt tgg egg egg cgc ctg ttt ate 769 
Leu Leu Leu Gly Gin Leu Trp Tyr Phe Trp Arg Arg Arg Leu Phe lie 
245 250 255 

tgg ate tea ttc atg gac age tac ttt gaa ate etc ttt etc ctt cag 817 
Trp lie Ser Phe Met Asp Ser Tyr Phe Glu lie Leu Phe Leu Leu Gin 
260 265 270 

get ctg etc aca gtg ctg tec cag gtg ctg cgc ttc atg gag act gaa 865 
Ala Leu Leu Thr Val Leu Ser Gin Val Leu Arg Phe Met Glu Thr Glu 
275 280 285 

tgg tac eta ccc ctg eta gtg tta tec eta gtg ctg ggc tgg ctg aac 913 
Trp Tyr Leu Pro Leu Leu Val Leu Ser Leu Val Leu Gly Trp Leu Asn 
290 295 300 



ctg ctt tac tac aca egg ggc ttt cag cac aca ggc ate tac agt gtc 961 
Leu Leu Tyr Tyr Thr Arg Gly Phe Gin His Thr Gly lie Tyr Ser Val 
305 310 315 320 

atg ate cag aag gtc ate ctt cga gac ctg etc cgt ttc ctg ctg gtc 1009 
Met lie Gin Lys Val lie Leu Arg Asp Leu Leu Arg Phe Leu Leu Val 
325 330 335 

tac ctg gtc ttc ctt ttc ggc ttt get gta gee eta gta age ttg age 1057 
Tyr Leu Val Phe Leu Phe Gly Phe Ala Val Ala Leu Val Ser Leu Ser 
340 345 350 

aga gag gee cga agt ccc aaa gee cct gaa gat aac aac tec aca gtg 1105 
Arg Glu Ala Arg Ser Pro Lys Ala Pro Glu Asp Asn Asn Ser Thr Val 
355 360 365 

acg gaa cag ccc acg gtg ggc cag gag gag gag cca get cca tat egg 1153 
Thr Glu Gin Pro Thr Val Gly Gin Glu Glu Glu Pro Ala Pro Tyr Arg 
370 375 380 

age att ctg gat gee tec eta gag ctg ttc aag ttc ace att ggt atg 1201 
Ser lie Leu Asp Ala Ser Leu Glu Leu Phe Lys Phe Thr lie Gly Met 
385 390 395 400 

ggg gag ctg get ttc cag gaa cag ctg cgt ttt cgt ggg gtg gtc ctg 124 9 
Gly Glu Leu Ala Phe Gin Glu Gin Leu Arg Phe Arg Gly Val Val Leu 
405 410 415 

ctg ttg ctg ttg gee tac gtc ctt etc ace tac gtc ctg ctg etc aac 1297 
Leu Leu Leu Leu Ala Tyr Val Leu Leu Thr Tyr Val Leu Leu Leu Asn 
420 425 430 

atg etc att get etc atg age gaa act gtc aac cac gtt get gac aac 1345 
Met Leu lie Ala Leu Met Ser Glu Thr Val Asn His Val Ala Asp Asn 
435 440 445 
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age tgg age ate tgg aag ttg cag aaa gee ate tct gtc ttg gag atg 1393 
Ser Trp Ser lie Trp Lys Leu Gin Lys Ala lie Ser Val Leu Glu Met 
450 455 460 



gag aat ggt tac tgg tgg tgc egg agg aag aaa cat cgt gaa ggg agg 
Glu Asn Gly Tyr Trp Trp Cys Arg Arg Lys Lys His Arg Glu Gly Arg 
465 470 475 480 



1441 



ctg ctg aaa gtc ggc acc agg ggg gat ggt acc cct gat gag cgc tgg 1489 
Leu Leu Lys Val Gly Thr Arg Gly Asp Gly Thr Pro Asp Glu Arg Trp 
485 490 495 

tgc ttc agg gtg gag gaa gta aat tgg get get tgg gag aag act ctt 1537 
Cys Phe Arg Val Glu Glu Val Asn Trp Ala Ala Trp Glu Lys Thr Leu 
500 505 510 

ccc acc tta tct gag gat cca tea ggg cca ggc ate act ggt aat aaa 1585 
Pro Thr Leu Ser Glu Asp Pro Ser Gly Pro Gly lie Thr Gly Asn Lys 
515 520 525 

aag aac cca acc tct aaa ccg ggg aag aac agt gec tea gag gaa gac 1633 
Lys Asn Pro Thr Ser Lys Pro Gly Lys Asn Ser Ala Ser Glu Glu Asp 
530 535 540 

cat ctg ccc ctt cag gtc etc cag tec ccc tgatggccca gatgeagcag 1683 
His Leu Pro Leu Gin Val Leu Gin Ser Pro 
545 550 

caggctggca ggatggagta gggaatcttc ccagccacac cagaggctac tgaattttgg 1743 
tggaaatata aatatttttt ttgcataaaa aaaaaaaaaa agggeggecg c 1794 

<210> 11 
<211> 554 
<212> PRT 
<213> Rattus sp. 

<400> 11 

Ser Thr His Ala Ser Ala Leu Ser Leu Ala Ala Cys Thr Lys Gin Trp 
15 10 15 

Asp Val Val Thr Tyr Leu Leu Glu Asn Pro His Gin Pro Ala Ser Leu 
20 25 30 

Glu Ala Thr Asp Ser Leu Gly Asn Thr Val Leu His Ala Leu Val Met 
35 40 45 

lie Ala Asp Asn Ser Pro Glu Asn Ser Ala Leu Val lie His Met Tyr 
50 55 60 

Asp Gly Leu Leu Gin Met Gly Ala Arg Leu Cys Pro Thr Val Gin Leu 
65 70 75 80 

Glu Glu lie Ser Asn His Gin Gly Leu Thr Pro Leu Lys Leu Ala Ala 
85 90 95 
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Lys Glu Gly Lys 
100 

Ser Gly Pro Tyr 
115 

Gly Pro Val Arg 
130 

Glu Lys Asn Ser 
145 

Asn Arg His Arg 



Glu Lys Trp Asp 
180 

Tyr Leu Val Tyr 

195 

Ser Leu Asp Gin 
210 

Ser Met Leu Leu 
225 

Leu Leu Leu Gly 



Trp He Ser Phe 
260 

Ala Leu Leu Thr 
275 

Trp Tyr Leu Pro 
290 

Leu Leu Tyr Tyr 
305 

Met He Gin Lys 



Tyr Leu Val Phe 
340 

Arg Glu Ala Arg 
355 

Thr Glu Gin Pro 
370 

Ser He Leu Asp 
385 

Gly Glu Leu Ala 



He Glu He Phe 



Gin Pro Leu Ser 
120 

Val Ser Leu Tyr 
135 

Val Leu Glu lie 
150 

Met Val Val Leu 
165 

Arg Leu Val Ser 



Met Phe He Phe 
200 

Pro Ala He Pro 
215 

Leu Gly His He 
230 

Gin Leu Trp Tyr 
245 

Met Asp Ser Tyr 



Val Leu Ser Gin 
280 

Leu Leu Val Leu 
295 

Thr Arg Gly Phe 
310 

Val He Leu Arg 
325 

Leu Phe Gly Phe 



Ser Pro Lys Ala 
360 

Thr Val Gly Gin 
375 

Ala Ser Leu Glu 
390 

Phe Gin Glu Gin 
405 



Arg His He Leu 
105 

Arg Lys Phe Thr 



Asp Leu Ser Ser 
140 

He Ala Phe His 
155 

Glu Pro Leu Asn 
170 

Arg Phe Phe Phe 
185 

Thr Val Val Ala 



Ser Ser Lys Ala 
220 

Leu lie Leu Leu 
235 

Phe Trp Arg Arg 
250 

Phe Glu lie Leu 
265 

Val Leu Arg Phe 



Ser Leu Val Leu 
300 

Gin His Thr Gly 
315 

Asp Leu Leu Arg 
330 

Ala Val Ala Leu 
345 

Pro Glu Asp Asn 



Glu Glu Glu Pro 
380 

Leu Phe Lys Phe 
395 

Leu Arg Phe Arg 
410 



Gin Arg Glu Phe 
110 

Glu Trp Cys Tyr 
125 

Val Asp Ser Trp 



Cys Lys Ser Pro 
160 

Lys Leu Leu Gin 
175 

Asn Phe Ala Cys 
190 

Tyr His Gin Pro 
205 

Thr Phe Gly Glu 



Gly Gly lie Tyr 
240 

Arg Leu Phe lie 
255 

Phe Leu Leu Gin 
270 

Met Glu Thr Glu 
285 

Gly Trp Leu Asn 



lie Tyr Ser Val 
320 

Phe Leu Leu Val 
335 

Val Ser Leu Ser 
350 

Asn Ser Thr Val 
365 

Ala Pro Tyr Arg 



Thr lie Gly Met 
400 

Gly Val Val Leu 
415 
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Leu Leu Leu Leu 
420 

Met Leu lie Ala 
435 

Ser Trp Ser lie 
450 

Glu Asn Gly Tyr 
4 65 

Leu Leu Lys Val 



Cys Phe Arg Val 
500 

Pro Thr Leu Ser 
515 

Lys Asn Pro Thr 
530 

His Leu Pro Leu 
545 



Ala Tyr Val Leu 



Leu Met Ser Glu 
440 

Trp Lys Leu Gin 
455 

Trp Trp Cys Arg 
470 

Gly Thr Arg Gly 
485 

Glu Glu Val Asn 



Glu Asp Pro Ser 
520 

Ser Lys Pro Gly 
535 

Gin Val Leu Gin 
550 



Leu Thr Tyr Val 
425 

Thr Val Asn His 



Lys Ala lie Ser 
460 

Arg Lys Lys His 
475 

Asp Gly Thr Pro 
490 

Trp Ala Ala Trp 
505 

Gly Pro Gly lie 



Lys Asn Ser Ala 
540 

Ser Pro 



Leu Leu Leu Asn 
430 

Val Ala Asp Asn 
445 

Val Leu Glu Met 



Arg Glu Gly Arg 
480 

Asp Glu Arg Trp 
495 

Glu Lys Thr Leu 
510 

Thr Gly Asn Lys 
525 

Ser Glu Glu Asp 



<210> 12 

<211> 1662 

<212> DNA 

<213> Rattus sp. 

<220> 

<221> CDS 

<222> (1) . • (1662) 

<400> 12 

teg acc cac gcg tec get ctt tct ctg get gcg tgc ace aag cag tgg 48 
Ser Thr His Ala Ser Ala Leu Ser Leu Ala Ala Cys Thr Lys Gin Trp 
15 10 15 

gat gtg gtg acc tac etc ctg gag aac cca cac cag ccg gec age ctg 96 
Asp Val Val Thr Tyr Leu Leu Glu Asn Pro His Gin Pro Ala Ser Leu 
20 25 30 

gag gec acc gac tec ctg ggc aac aca gtc ctg cat get ctg gta atg 144 
Glu Ala Thr Asp Ser Leu Gly Asn Thr Val Leu His Ala Leu Val Met 
35 40 45 

att gca gat aac teg cct gag aac agt gec ctg gtg ate cac atg tac 192 
lie Ala Asp Asn Ser Pro Glu Asn Ser Ala Leu Val lie His Met Tyr 
50 55 60 

gac ggg ctt eta caa atg ggg gcg cgc etc tgc ccc act gtg cag ctt 240 
Asp Gly Leu Leu Gin Met Gly Ala Arg Leu Cys Pro Thr Val Gin Leu 
65 70 75 80 
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gag gaa ate tec aac cac caa ggc etc aca ccc ctg aaa eta gee gec 288 
Glu Glu lie Ser Asn His Gin Gly Leu Thr Pro Leu Lys Leu Ala Ala 
85 90 95 

aag gaa ggc aaa ate gag att ttc agg cac att ctg cag egg gaa ttc 336 
Lys Glu Gly Lys lie Glu lie Phe Arg His He Leu Gin Arg Glu Phe 
100 105 110 

tea gga ccg tac cag ccc ctt tec cga aag ttt act gag tgg tgt tac 384 
Ser Gly Pro Tyr Gin Pro Leu Ser Arg Lys Phe Thr Glu Trp Cys Tyr 
115 120 125 

ggt cct gtg egg gta teg ctg tac gac ctg tec tct gtg gac age tgg 432 
Gly Pro Val Arg Val Ser Leu Tyr Asp Leu Ser Ser Val Asp Ser Trp 
130 135 140 

gaa aag aac teg gtg ctg gag ate ate get ttt cat tgc aag age ccg 480 
Glu Lys Asn Ser Val Leu Glu He He Ala Phe His Cys Lys Ser Pro 
145 150 155 160 

aac egg cac cgc atg gtg gtt tta gaa cca ctg aac aag ctt ctg cag 528 
Asn Arg His Arg Met Val Val Leu Glu Pro Leu Asn Lys Leu Leu Gin 
165 170 175 

gag aaa tgg gat egg etc gtc tea aga ttc ttc ttc aac ttc gec tgc 576 
Glu Lys Trp Asp Arg Leu Val Ser Arg Phe Phe Phe Asn Phe Ala Cys 
180 185 190 

tac ttg gtc tac atg ttc ate ttc ace gtc gtt gee tac cac cag cct 624 
Tyr Leu Val Tyr Met Phe He Phe Thr Val Val Ala Tyr His Gin Pro 
195 200 205 

tec ctg gat cag cca gee ate ccc tea tea aaa gcg act ttt ggg gaa 672 
Ser Leu Asp Gin Pro Ala lie Pro Ser Ser Lys Ala Thr Phe Gly Glu 
210 215 220 

tec atg ctg ctg ctg ggc cac att ctg ate ctg ctt ggg ggt att tac 720 
Ser Met Leu Leu Leu Gly His lie Leu He Leu Leu Gly Gly He Tyr 
225 230 235 240 

etc tta ctg ggc cag ctg tgg tac ttt tgg egg egg cgc ctg ttt ate 768 
Leu Leu Leu Gly Gin Leu Trp Tyr Phe Trp Arg Arg Arg Leu Phe lie 
245 250 255 

tgg ate tea ttc atg gac age tac ttt gaa ate etc ttt etc ctt cag 816 
Trp lie Ser Phe Met Asp Ser Tyr Phe Glu lie Leu Phe Leu Leu Gin 
260 265 270 

get ctg etc aca gtg ctg tec cag gtg ctg cgc ttc atg gag act gaa 864 
Ala Leu Leu Thr Val Leu Ser Gin Val Leu Arg Phe Met Glu Thr Glu 
275 280 285 

tgg tac eta ccc ctg eta gtg tta tec eta gtg ctg ggc tgg ctg aac 912 
Trp Tyr Leu Pro Leu Leu Val Leu Ser Leu Val Leu Gly Trp Leu Asn 
290 295 300 

ctg ctt tac tac aca egg ggc ttt cag cac aca ggc ate tac agt gtc 960 
Leu Leu Tyr Tyr Thr Arg Gly Phe Gin His Thr Gly lie Tyr Ser Val 
305 310 315 320 
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atg ate cag aag gtc ate ctt cga gac ctg etc cgt ttc ctg ctg gtc 1008 
Met lie Gin Lys Val lie Leu Arg Asp Leu Leu Arg Phe Leu Leu Val 
325 330 335 

tac ctg gtc ttc ctt ttc ggc ttt get gta gee eta gta age ttg age 1056 
Tyr Leu Val Phe Leu Phe Gly Phe Ala Val Ala Leu Val Ser Leu Ser 
340 345 350 

aga gag gec cga agt ccc aaa gec cct gaa gat aac aac tec aca gtg 1104 
Arg Glu Ala Arg Ser Pro Lys Ala Pro Glu Asp Asn Asn Ser Thr Val 
355 360 365 

acg gaa cag ccc acg gtg ggc cag gag gag gag cca get cca tat egg 1152 
Thr Glu Gin Pro Thr Val Gly Gin Glu Glu Glu Pro Ala Pro Tyr Arg 
370 375 380 

age att ctg gat gee tec eta gag ctg ttc aag ttc ace att ggt atg 1200 
Ser lie Leu Asp Ala Ser Leu Glu Leu Phe Lys Phe Thr lie Gly Met 
385 390 395 400 

ggg gag ctg get ttc cag gaa cag ctg cgt ttt cgt ggg gtg gtc ctg 1248 
Gly Glu Leu Ala Phe Gin Glu Gin Leu Arg Phe Arg Gly Val Val Leu 
405 410 415 

ctg ttg ctg ttg gee tac gtc ctt etc ace tac gtc ctg ctg etc aac 1296 
Leu Leu Leu Leu Ala Tyr Val Leu Leu Thr Tyr Val Leu Leu Leu Asn 
420 425 430 

atg etc att get etc atg age gaa act gtc aac cac gtt get gac aac 1344 
Met Leu lie Ala Leu Met Ser Glu Thr Val Asn His Val Ala Asp Asn 
435 440 445 

age tgg age ate tgg aag ttg cag aaa gee ate tct gtc ttg gag atg 1392 
Ser Trp Ser lie Trp Lys Leu Gin Lys Ala lie Ser Val Leu Glu Met 
450 455 460 

gag aat ggt tac tgg tgg tgc egg agg aag aaa cat cgt gaa ggg agg 1440 
Glu Asn Gly Tyr Trp Trp Cys Arg Arg Lys Lys His Arg Glu Gly Arg 
465 470 475 480 

ctg ctg aaa gtc ggc ace agg ggg gat ggt acc cct gat gag cgc tgg 1488 
Leu Leu Lys Val Gly Thr Arg Gly Asp Gly Thr Pro Asp Glu Arg Trp 
485 490 495 

tgc ttc agg gtg gag gaa gta aat tgg get get tgg gag aag act ctt 1536 
Cys Phe Arg Val Glu Glu Val Asn Trp Ala Ala Trp Glu Lys Thr Leu 
500 505 510 

ccc acc tta tct gag gat cca tea ggg cca ggc ate act ggt aat aaa 1584 
Pro Thr Leu Ser Glu Asp Pro Ser Gly Pro Gly lie Thr Gly Asn Lys 
515 520 525 

aag aac cca acc tct aaa ccg ggg aag aac agt gee tea gag gaa gac 1632 
Lys Asn Pro Thr Ser Lys Pro Gly Lys Asn Ser Ala Ser Glu Glu Asp 
530 535 540 

cat ctg ccc ctt cag gtc etc cag. tec ccc 1662 
His Leu Pro Leu Gin Val Leu Gin Ser Pro 
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545 550 



<210> 13 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic 
peptide 

<400> 13 

Ala Phe His Cys Lys Ser Pro His Arg His Arg Met Val Val Leu Glu 
15 10 15 



<210> 14 
<211> 25 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic 
peptide 

<400> 14 

Arg Pro Glu Ala Pro Thr Gly Pro Asn Ala Thr Glu Ser Val Gin Pro 
15 10 15 



Met Glu Gly Gin Glu Asp Glu Gly Asn 
20 25 

<210> 15 
<211> 19 
<212> PRT 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: synthetic 
peptide 

<400> 15 

Ser Val Leu Glu Met Glu Asn Gly Tyr Trp Trp Cys Arg Lys Lys Gin 
15 10 15 

Arg Ala Gly 



<210> 16 
<211> 20 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 16 

taggagaccc cgttgccacg 20 



<210> 17 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 
<400> 17 

gattcacttg gggacagtga eg 22 

<210> 18 
<2il> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 
<400> 18 

ttaagctccc gttctccacc g 21 



<210> 19 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 
<400> 19 

getgegggag gaagtgaagc 20 



<210> 20 
<211> 630 
<212> PRT 

<213> Homo sapiens 
<400> 20 

Met Thr Ser Pro Ser Ser Ser Pro Val Phe Arg Leu Glu Thr Leu Asp 
15 10 15 

Gly Gly Gin Glu Asp Gly Ser Glu Ala Asp Arg Gly Lys Leu Asp Phe 
20 25 30 
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Gly Ser Gly Leu Pro Pro Met Glu Ser Gin Phe Gin Gly Glu Asp Arg 
35 40 45 

Lys Phe Ala Pro Gin lie Arg Val Asn Leu Asn Tyr Arg Lys Gly Thr 
50 55 60 

Gly Ala Ser Gin Pro Asp Pro Asn Arg Phe Asp Arg Asp Arg Leu Phe 
65 70 75 80 

Asn Ala Val Ser Arg Gly Val Pro Glu Asp Leu Ala Gly Leu Pro Glu 
85 90 95 



Tyr Leu Ser Lys Thr Ser Lys Tyr Leu Thr Asp Ser Glu Tyr Thr Glu 
100 105 110 

Gly Ser Thr Gly Lys Thr Cys Leu Met Lys Ala Val Leu Asn Leu Lys 
115 120 125 

Asp Gly Val Asn Ala Cys lie Leu Pro Leu Leu Gin lie Asp Arg Asp 
130 135 140 

Ser Gly Asn Pro Gin Pro Leu Val Asn Ala Gin Cys Thr Asp Asp Tyr 
145 150 155 160 

Tyr Arg Gly His Ser Ala Leu His lie Ala lie Glu Lys Arg Ser Leu 
165 170 175 

Gin Cys Val Lys Leu Leu Val Glu Asn Gly Ala Asn Val His Ala Arg 
180 185 190 

Ala Cys Gly Arg Phe Phe Gin Lys Gly Gin Gly Thr Cys Phe Tyr Phe 
195 200 205 

Gly Glu Leu Pro Leu Ser Leu Ala Ala Cys Thr Lys Gin Trp Asp Val 
210 215 220 

Val Ser Tyr Leu Leu Glu Asn Pro His Gin Pro Ala Ser Leu Gin Ala 
225 230 235 240 

Thr Asp Ser Gin Gly Asn Thr Val Leu His Ala Leu Val Met lie Ser 
245 250 255 

Asp Asn Ser Ala Glu Asn lie Ala Leu Val Thr Ser Met Tyr Asp Gly 
260 265 270 

Leu Leu Gin Ala Gly Ala Arg Leu Cys Pro Thr Val Gin Leu Glu Asp 
275 280 285 

lie Arg Asn Leu Gin Asp Leu Thr Pro Leu Lys Leu Ala Ala Lys Glu 
290 295 300 

Gly Lys lie Glu lie Phe Arg His lie Leu Gin Arg Glu Phe Ser Gly 
305 310 315 320 

Leu Ser His Leu Ser Arg Lys Phe Thr Glu Trp Cys Tyr Gly Pro Val 
325 330 335 
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Arg Val Ser Leu Tyr Asp Leu Ala Ser Val Asp Ser Cys Glu Glu Asn 
340 345 350 

Ser Val Leu Glu lie lie Ala Phe His Cys Lys Ser Pro His Arg His 
355 360 365 

Arg Met Val Val Leu Glu Pro Leu Asn Lys Leu Leu Gin Ala Lys Trp 
370 375 380 

Asp Leu Leu lie Pro Lys Phe Phe Leu Asn Phe Leu Cys Asn Leu lie 
385 390 395 400 

Tyr Met Phe lie Phe Thr Ala Val Ala Tyr His Gin Pro Thr Leu Lys 
405 410 415 

Lys Gin Ala Ala Pro His Leu Lys Ala Glu Val Gly Asn Ser Met Leu 
420 425 430 

Leu Thr Gly His lie Leu lie Leu Leu Gly Gly lie Tyr Leu Leu Val 
435 440 445 

Gly Gin Leu Trp Tyr Phe Trp Arg Arg His Val Phe lie Trp lie Ser 
450 455 460 

Phe lie Asp Ser Tyr Phe Glu lie Leu Phe Leu Phe Gin Ala Leu Leu 
465 470 475 480 

Thr Val Val Ser Gin Val Leu Cys Phe Leu Ala lie Glu Trp Tyr Leu 
485 490 495 

Pro Leu Leu Val Ser Ala Leu Val Leu Gly Trp Leu Asn Leu Leu Tyr 
500 505 510 

Tyr Thr Arg Gly Phe Gin His Thr Gly lie Tyr Ser Val Met lie Gin 
515 520 525 

Lys Lys Ala lie Ser Val Leu Glu Met Glu Asn Gly Tyr Trp Trp Cys 
530 535 540 

Arg Lys Lys Gin Arg Ala Gly Val Met Leu Thr Val Gly Thr Lys Pro 
545 550 555 560 

Asp Gly Ser Pro Asp Glu Arg Trp Cys Phe Arg Val Glu Glu Val Asn 
565 570 575 

Trp Ala Ser Trp Glu Gin Thr Leu Pro Thr Leu Cys Glu Asp Pro Ser 
580 585 590 

Gly Ala Gly Val Pro Arg Thr Leu Glu Asn Pro Val Leu Ala Ser Pro 
595 600 605 

Pro Lys Glu Asp Glu Asp Gly Ala Ser Glu Glu Asn Tyr Val Pro Val 
610 615 620 

Gin Leu Leu Gin Ser Asn 
625 630 
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